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OM protein - protein search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



Searched: 



September 3, 2004, 15:56:54 ; Search time 79.2611 Seconds 

(without alignments) 
4199.299 Million cell updates/sec 

US-09-830-972-29 
5923 

1 MEDLDQSPLVSSSDSPPRPQ VKDAMAKIQAKIPGLKRKAE 1178 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 



1586107 



1586107 seqs, 282547505 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : A_Geneseq_29 Jan04 : * 

1: geneseqpl980s : * 

2 : gene seqp 1990s:* 

3: geneseqp2000s : * 

4: geneseqp2001s : * 

5 : geneseqp2002s : * 

6: geneseqp2003as : * 

7: geneseqp2003bs : * 

8: geneseqp2004s : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result 
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Score 
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DB 


ID 
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1 


5882 


99.3 


1178 


3 


AAY71311 


Aay71311 


Human 
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5815 


98.2 


1192 


4 


AAU04591 


Aau04591 


Human 
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5815 


98.2 


1192 


5 


ABP68600 


Abp68600 


Human 


4 


5815 


98.2 


1192 


6 


ABR59667 


Abr59667 


Human 


5 


5810 


98.1 


1192 


3 


AAY56967 


Aay56967 


Human 


6 


5810 


98.1 


1192 


4 


AAB82349 


Aab82349 


Human 


7 


5810 


98.1 


1192 


5 


ABG30938 


Abg30938 


Human 


8 


5810 


98.1 


1192 


5 


ABB81078 


Abb81078 


Human 


9 


5526.5 


93.3 


1246 
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AAU33228 
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Novel 
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hum 
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Human MDD 
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74 
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AAY95012 
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Human sec 
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.5 


72 
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3 


AAY71310 


Aay71310 


Rat 


neuri 


13 
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72 


. 5 
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5 


ABB81074 


Abb81074 


Rat 


neuro 
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.5 
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3 


AAY71557 
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Alternati 
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Rat 


Nogo 
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. 1 


642 


2 


AAW58383 


Aaw5 8383 


Human sec 


18 


3146, 
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53 


. 1 
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4 


AAB90682 
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Human BG1 


19 
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45 


. 8 
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3 


AAY71562 


Aay71562 


Rat 


Nogo 


20 
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.5 


42 


. 7 
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3 


AAY71386 


Aay7138 6 


Rat 


Nogo 


21 
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.5 


42 


. 0 
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3 


AAY71391 
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Nogo 


22 
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41 


. 5 


736 


3 


AAY71398 
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Nogo 


23 


2449. 
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41 


. 4 
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3 


AAY71399 
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Nogo 
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Nogo 
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.5 
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3 
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3 
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Nogo 


27 
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3 
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Rat 


Nogo 


28 
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,5 


27 


. 6 
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3 


AAY7138 9 


Aav7 13 8 9 

1 LCI y t _L — ) u > 


Rat 


Nogo 


29 


1566. 
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Nogo 
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ALIGNMENTS 



RESULT 1 
AAY71311 

ID AAY71311 standard; protein; 1178 AA. 
XX 

AC AAY71311; 
XX 

DT 02-NOV-2000 (first entry) 
XX 

DE Human neurite growth inhibitor Nogo. 
XX 

KW Human; neurite growth inhibitor; Nogo; neural cell; myelin; CNS; 

KW central nervous system; neoplastic disease; antiproliferative; glioma; 

KW antisense gene therapy; neuroblastoma; menagioma; retinoblastoma; 

KW degenerative nerve disease; Alzheimer ! s disease; Parkinson's disease; 

KW hyperprolif erative disorder; benign dysprolif erative disorder; diagnosis; 

KW psoriasis; tissue hypertrophy; neuronal regeneration; treatment; 



KW structural plasticity; screening. 
XX 

OS Homo sapiens . 
XX 

FH Key Location/Qualif iers 

FT Misc-dif ference 187 

FT /label= Unknown 

FT Misc-dif ference 188 

FT /label= Unknown 

FT Misc-dif ference 189 

FT /label= Unknown 

FT Misc-dif ference 190 

FT /label= Unknown 

FT Misc-dif ference 221 

FT /label= Unknown 

FT Misc-dif ference 328 

FT /label= Unknown 

FT Misc-dif ference 477 

FT /label= Unknown 

FT Region 977. .1012 

FT /note= "Region specifically described in claim 16" 

FT Region 994. .1174 

FT /note= "Region specifically described in claim 16" 

FT Region 1079. .1114 

FT /note= "Region specifically described in claim 16" 
XX 

PN WO200031235-A2. 
XX 

PD 02-JUN-2000. 
XX 

PF 05-NOV-1999; 99WO-US026160 . 
XX 

PR 06-NOV-1998; 98US-0107446P . 
XX 

PA (SCHW/) SCHWAB M E. 

PA (CHEN/) CHEN M S. 
XX 

PI Schwab ME, Chen MS; 
XX 

DR WPI; 2000-400052/34. 
XX 

PT Nogo proteins and nucleic acids useful for treating neoplastic disorders 

PT of the central nervous system and inducing regeneration of neurons. 

XX 

PS Claim 11; Fig 13; 122pp; English. 
XX 

CC The present sequence is a human Nogo protein which is a potent neural 

CC cell growth inhibitor and is. free of all central nervous system (CNS) 

CC myelin material with which it is natively associated. The human Nogo 

CC sequence was derived by aligning human expressed sequence tags (ESTs) 

CC e.g. AA158636, AA333267, AA081783, AA167765, AA322918, AA092565, AA081525 

CC and AA081840 with the rat Nogo sequence. Nogo proteins and fragments 

CC displaying neurite growth inhibitory activity are used in the treatment 

CC of neoplastic disease of the CNS e.g. glioma, glioblastoma, 

CC medulloblastoma, craniopharyngioma, ependyoma, pinealoma, 

CC haemangioblastoma, acoustic neuroma, oligodendroglioma, menagioma, 

CC neuroblastoma or retinoblastoma and degenerative nerve diseases e.g. 



CC Alzheimer ! s and Parkinson's diseases. Therapeutics which promote Nogo 

CC activity can be used to treat or prevent hyperprolif erative or benign 

CC dysprolif erative disorders e.g. psoriasis and tissue hypertrophy. 

CC Ribozymes or antisense Nogo nucleic acids can be used to inhibit 

CC production of Nogo protein to induce regeneration of neurons or to 

CC promote structural plasticity of the CNS in disorders where neurite 

CC growth, regeneration or maintenance are deficient or desired. The animal 

CC models can be used in diagnostic and screening methods for predisposition 

CC to disorders and to screen for or test molecules which can treat or 

CC prevent disorders or diseases of the CNS. Note: SEQ ID numbers 35-42 are 

CC referred in claim 32 and SEQ ID NO: 29 in disclosure of the 

CC specification. However the specification does not include sequences for 

CC these SEQ ID numbers 

XX 

SQ Sequence 1178 AA; 

Query Match 99.3%; Score 5882; DB 3; Length 1178; 

Best Local Similarity 99.7%; Pred. No. 5.4e-296; 

Matches 1175; Conservative 0; Mismatches 3; Indels 0; Gaps 0; 

Qy 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I I II I I 
Db 1 MEDLDQSPLVSSSDSVPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

Qy 61 AGL S AAP VPTAP AAGAP LMD FGND EVP PAP RG P LP AAP P VAP E RQ P S WD P S PVS S T VPAP 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 
Db 61 AGL S AAP VPTAP AAGAP LMD FGND FVP PAP RG P L P AAP P VAP ERQ P S WD P S PVS S T VP AP 120 

Qy 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 18 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 180 

Qy 181 SSGAWXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSPLSAASFKEHEYLGN 240 

I I I I II I II I II I I I I I II I I I II I I I II I I I I I I I I I II I I I I I I I I I I I M II I I I II 
Db 181 SSGAWXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSPLSAASFKEHEYLGN 240 

Qy 241 LSTVLPTEGTLQENVS EASKEVSEKAKTLLIDRDLTEFSELEYSEMGSSFSVSPKAESAV 300 

I I I I I I I I I I I I I I I I I I I I M I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 
Db 241 LSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEMGSSFSVSPKAESAV 300 

Qy 301 IVANPREEIIVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDEWSSEKAKDSFNEKRV 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I II I 
Db 301 IVANPREEIIVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDEWSSEKAKDSFNEKRV 360 

Qy 361 AVEAPMREEYADFKPFERWEVKDSKEDSDMLAAGGKIESNLESKVDKKCFADSLEQTNH 420 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 AVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKVDKKCFADSLEQTNH 420 

Qy 421 EKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIFPLLEDPTSENXTDE 480 

I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I II II I I II II 
Db 421 EKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIFPLLEDPTSENXTDE 480 

Qy 481 KKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTEEWANMPEGLTPDL 540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I M I I I I I I I I I I I I I I I I I I I I 
Db 481 KKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSCTDYVTTDNLTKVTEEWANMPEGLTPDL 540 



Qy 



541 VQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSFEESEATPSPVLPDI 600 



Db 541 VQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSFEESEATPSPVLPDI 600 

Qy 601 VMEAPLNSAVP SAGAS VIQPSSS PLEAS SVN YES I KHEPENPPPYEEAMSVSLKVSGIKE 660 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I II I II I I I I 
Db 601 VMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYEEAMSVSLKVSGIKE 660 

Qy 661 EIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQPVPDHSELV 720 

I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 
Db 661 EIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQPVPDHSELV 720 

Qy 721 EDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMIEYENKEKLSALPPEG 780 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I II I I I I I I I I I I I I II I I I I I I I I I 
Db 721 EDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMIEYENKEKLSALPPEG 780 

Qy 781 GKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVYSNDDLFISKEAQIRE 840 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 
Db 781 GKPYLESFKLSLVNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVYSNDDLFISKEAQIRE 840 

Qy 841 TETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEIANAPDGAGSLPCTEL 900 

I I I I II I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I 
Db 841 TETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEIANAPDGAGSLPCTEL 900 

Qy 901 PHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGHTQAEIESIVKPKVLE 960 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I II I I I I I I I I I I I I I II I I I I I I I I I I I I I 
Db 901 PHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGHTQAEIESIVKPKVLE 960 

Qy 961 KEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGWFGASLFLLLSLTVF 1020 

I I I I I II I I I II I I II I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 961 KEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGWFGASLFLLLSLTVF 1020 

Qy 1021 SIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNS 1080 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I II I I II I I I 
Db 1021 SIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNS 1080 

Qy 1081 ALGHWCTIKELRRLFLVDDLVDSLKFAVl^ 1140 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I 
Db 1081 ALGHWCTI KELRRLFLVDDLVDS LKFAVLMWVFTYVGALFNGLTLLI LALI S LFSVPVI 1140 

Qy 1141 YERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1178 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 
Db 1141 YERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1178 

RESULT 2 
AAU04591 



ID AAU04591 standard; protein; 1192 AA. 
XX 

AC AAU04591; 
XX 

DT 26-SEP-2001 (first entry) 
XX 

DE Human Nogo protein. 
XX 

KW Human; Nogo receptor; axonal growth; immunogen; antibody; nogo protein; 

KW cranial trauma; cerebral trauma; spinal cord injury; stroke; 

KW demyelinating disease; multiple sclerosis; monophasis demyelination; 



KW encephalomyelitis; multifocal leukoencephalopathy; panencephalitis; 

KW Marchiaf ava-Bignami disease; pontine myelinolysis ; adrenoleukoclys trophy; 

KW Pelizaeus-Merzbacher disease; Spongy degeneration; Alexander's disease; 

KW Canavan's disease; metachromatic leukodystrophy; viral infection; 

KW Krabbe's disease. 

XX 

OS Homo sapiens. 



XX 

FH Key Location/Qualifiers 

FT Domain 1054. .1119 

FT /label= Lumenal_extracellular_domain 

FT /note= "This sequence is specifically claimed" 

FT Peptide 1055. .1094 

FT /label= Pepl 

FT /note- "Receptor binding inhibitory peptide. This 

FT sequence is specifically claimed" 

FT Peptide 1064. .1088 

FT /label= Pep2 

FT /note= "Receptor binding inhibitory peptide. This 

FT sequence is specifically claimed" 

FT Peptide 1074. .1098 

FT /label= Pep3 

FT /note= "Receptor binding inhibitory peptide. This 

FT sequence is specifically claimed" 

FT Peptide 1084. .1108 

FT /label= Pep4 

FT /note= "Receptor binding inhibitory peptide. This 

FT sequence is specifically claimed" 

FT Peptide 1095. .1119 

FT /label= Pep5 

FT /note= "Receptor binding inhibitory peptide. This 

FT sequence is specifically claimed" 

XX 



PN WO200151520-A2. 
XX 

PD 19-JUL-2001. 
XX 

PF 12-JAN-2001; 2001WO-US001041 . 
XX 

PR 12-JAN-2000; 2000US-0175707P . 

PR 26-MAY-2000; 2000US-0207366P . 

PR 29-SEP-2000; 2000US-0236378P . 
XX 

PA (UYYA ) UNIV YALE. 
XX 

PI Strittmatter SM; 
XX 

DR WPI; 2001-442138/47. 

DR N-PSDB; AAS09453. 
XX 

PT Novel Nogo receptor protein useful for identifying modulator of Nogo 

PT protein or Nogo receptor protein, which is useful for treating central 

PT nervous system disorders. 
XX 

PS Example 1; Page 101-104; 109pp; English. 
XX 

CC The sequence is the human Nogo protein, a 250kDa myelin-associated axon 



CC growth inhibitor. The invention relates to the use of the nogo receptor, 

CC nogo protein, their nucleic acids, vectors expressing them and antibodies • 

CC against them, to isolate agents which block nogo receptor mediated axonal 

CC growth. The agent is useful for treating a central nervous system 

CC disorder which is a result of cranial or cerebral trauma, spinal cord 

CC injury, stroke or a demyelinating disease selected from multiple 

CC sclerosis, monophasis demyelination, encephalomyelitis, multifocal 

CC leukoencephalopathy, panencephalitis, Marchiaf ava-Bignami disease, 

CC pontine myelinolysis , adrenoleukodystrophy, Pelizaeus-Merzbacher disease, 

CC Spongy degeneration, Alexander's disease, Canavan's disease, 

CC metachromatic leukodystrophy, viral infection and Krabbe 1 s disease 

XX 

SQ Sequence 1192 AA; 

Query Match 98.2%; Score 5815; DB 4; Length 1192; 

Best Local Similarity 97.2%; Pred. No. 1.6e-292; 

Matches 1160; Conservative 4; Mismatches 13; Indels 16; Gaps 3; 

MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 
I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | M I I I I I I I I I 
MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

AGL S AAP VP T AP AAGAP LMD FGN D FVP PAP RG P L P AAP P VAP E RQ P S W D P S P VS S T VP AP 12 0 
I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
AGL S AAP VP T APAAGAP LMD FGN D FVP PAP RG P L PAAP P VAP ERQPSWDPS P VS S T VP AP 120 

SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 18 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 180 

SSGA WXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSP 22 6 

Ml: I : : II I I I I I I I I I II I I I I II I I I I I I I I I I I I I I 

SSGSVTJETLFALPAASEPVIRSSAENMDLKEQPGNTISAGQEDFPSVLLETAASLPSLSP 24 0 

LSAAS FKEHEYLGNLSTVLPTEGTLQENVSEAS KEVSEKAKTLLI DRDLTEFS ELEYS EM 2 86 

M I I I I I I I I I I I I I II I II I I I I I I I I I I I M I I I I I I I I I I I I I I M I II I I I I I I I I 

LSAAS FKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 300 

GS S FSVS PKAESAVI VANPREEI IVKNKDEEEKLVSNNI LHXQQELPTALTKLVKEDEVV 34 6 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I II I I I I I I I I I I I 
GSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHNQQELPTALTKLVKEDEW 3 60 

SSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKV 406 
I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I II I II 
SSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKV 42 0 

DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIF 4 66 

I I I I I I II I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I M II 
DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIF 4 80 

PLLEDPTSENXTDEKKIEEKKAQIWEKNTSTKTSNPFEVAAQDSETDYVTTDNLTKVTE 52 6 
III I I I I I I I I M I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I 
PLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLTKVTE 54 0 

EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 586 

II M I I I I I I I I I M I I II I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I I I I I 
EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 600 



Qy 


1 


Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


227 


Db 


241 


Qy 


287 


Db 


301 


Qy 


347 


Db 


361 


Qy 


407 


Db 


421 


Qy 


467 


Db 


481 


Qy 


527 


Db 


541 



Qy 587 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 646 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 
Db 601 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 660 

Qy 647 EAMSVSL-KVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 705 

I I II I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 661 EAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 720 

Qy 706 MAKVEQPVPDHSELVEDS S PDS EPVDLFS DDS I PDVPQKQDETVMLVKES LTETS FESMI 765 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 
Db 721 MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 780 

Qy 766 EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 825 

I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 
Db 781 EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 84 0 

Qy 826 SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 885 

I I I I I I I I I I I I I I I! II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 841 SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 900 

Qy 886 ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGH 945 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I M I I I I I I I I I I I I I I I I I I II 
Db 901 ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALA- 959 

Qy 946 TQMIESIWPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGV 1005 

I I I I 1 1 I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I II I I I I I I I 
Db 960 TQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSWDLLYWRDIKKTGV 1019 

Qy 1006 VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRI YKGVI QAIQKSDEGHPFRAYLES 1065 

I I I I I I II I I I I I I I II I I I I I I II I I I I I I II I I I I II I I I I I I I I I I I I I I I I I I I I I 
Db 1020 VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI QAIQKSDEGHPFRAYLES 1079 

Qy 1066 EVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWFTYVGALFNGLT 1125 

I I I II I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I M I I I I I I I I I I I I I I I I I II I 
Db 1080 EVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLT 1139 

Qy 1126 LLI LALI S LFSVPVI YERHQAQI DHYLGLANKNVKDAMAKIQAKI PGLKRKAE 1178 

M I I I I I I M I I I I I II I I I II I I I I I I I II I I I I I I I I I I I I II I I I I I I I I 
Db 1140 LLI LALI S L FSVPVI YERHQAQI DHYLGLAN KNVKDAMAKI QAKI PGLKRKAE 1192 



RESULT 3 
ABP68600 

ID ABP68600 standard; protein; 1192 AA. 
XX 

AC ABP68600; 
XX 

DT 14-JAN-2003 (first entry) 
XX 

DE Human pancreatic cancer expressed protein SEQ ID NO 71. 
XX 

KW Human; pancreas; cancer; gene therapy; vaccine; immuno stimulant ; 

KW cytostatic; tumour. 

XX 

OS Homo sapiens. 
XX 



PN WO200260317-A2 . 
XX 

PD 08-AUG-2002. 
XX 

PF 30-JAN-2002; 2002WO-US0027 81 . 
XX 

PR 30-JAN-2001; 2001US-02 65305P . 

PR 31-JAN-2001; 2001US-0265682P . 

PR 09-FEB-2001; 2001US-0267568P . 

PR 21-MAR-2001; 2001US-0278651P . 

PR 28-APR-2001; 2001US-0287112P . 

PR 16-MAY-2001; 2 001US-02 91631P . 

PR 12-JUL-2001; 2001US-0305484P . 

PR 20-AUG-2001; 2001US-0313999P . 

PR 27-NOV-2001; 2 001US-033362 6P . 
XX 

PA (CORI-) CORIXA CORP. 
XX 

PI Benson DR, Kalos MD, Lodes MJ, Persing DH, Hepler WT, Jiang Y; 
XX 

DR WPI; 2002-627435/67. 

DR N-PSDB; ABV94680. 
XX 

PT New isolated polynucleotide and pancreatic tumor polypeptides, useful for 

PT diagnosing, preventing and/or treating cancer, particularly pancreatic 

PT cancer. 
XX 

PS Claim 2; SEQ ID NO 71; 300pp + Sequence Listing; English. 
XX 

CC The invention relates to an isolated polynucleotide (I) comprising: (a) 

CC any of a group of over 4000 nucleotide sequences (ABV94 62 8-ABV99145 ) ; (b) 

CC complements of (a) ; (c) sequences consisting of at least 20 contiguous 

CC residues of (a) ; (d) sequences that hybridize to (a) , under moderately 

CC stringent conditions; (e) sequences having at least 75% or 90% identity 

CC to (a) ; or (f ) degenerate variants of (a) . Polypeptides (ABP68596- 

CC ABP68 637) encoded by (I) and oligonucleotide can be used to detect cancer 

CC in a patient and compositions comprising polypeptides, polynucleotides, 

CC antibodies, fusion proteins, T cell populations and antigen presenting 

CC cells expressing the polypeptide are useful in treating pancreatic cancer 

CC and stimulating an immune response. The polynucleotides can be used as 

CC probes or primers for nucleic acid hybridisation, in the design and 

CC preparation of ribozyme molecules for inhibiting expression of the tumour 

CC polypeptides and proteins in the tumour cells, in vaccines and for gene 

CC therapy. Note: The sequence data for this patent did not form part of the 

CC printed specification, but was obtained in electronic format directly 

CC from WIPO at ftp.wipo.int/pub/published_pct_sequences 
XX 

SQ Sequence 1192 AA; 

Query Match 98.2%; Score 5815; DB 5; Length 1192; 

Best Local Similarity 97.2%; Pred. No. 1.6e-292; 

Matches 1160; Conservative 4; Mismatches 13; Indels 16; Gaps 3 



Qy 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 1 I I I I I I I 
Db 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 



Qy 61 AGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAP 120 

I I I ! I I I 1 I I I I I I I I I I I I 1 M I I ) I I M I I I I I I I I I I I I I I I I I I I I I M I I I I I I I 

Db 61 AG L S AAP VP T AP AAGAP LMD FGND FVP PAP RG P L P AAP P VAP ERQPSWDPS P VS S T VP AP 12 0 

Qy 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 180 

I I 1 I I I I I I II I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I 

Db 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 180 

Qy 181 SSGA WXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSP 226 

III: I : : I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 

Db 181 SSGSVDETLFALPAASEPVIRSSAENMDLKEQPGNTISAGQEDFPSVLLETAASLPSLSP 240 

Qy 227 LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 286 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I II 

Db 241 LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 300 

Qy 287 GSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDEW 346 

I I II I I I I I I II I I I I I II I I I I I I I I I I I I I I I II II I I I I I I I I I I II I I I I I I I I I 

Db 301 GSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHNQQELPTALTKLVKEDEW 360 

Qy 347 SSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKV 406 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 361 S S EKAKD S FNEKRVAVEAPMREE YADFKP FERVWEVKDS KEDS DMLAAGGKI E SNLES KV 420 

Qy 407 DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIF 466 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 

Db 421 DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIF 480 

Qy 4 67 PLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTE 52 6 

III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 

Db 4 81 PLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLTKVTE 540 

Qy 527 EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 586 

I I I I I I I I I I I I I I M I II I I I I I II I I I I I I I I I I I I I I I I II I I I M I I I I I I I M I I 

Db 541 EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 600 

Qy 587 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 646 

I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 601 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 660 

Qy 647 EAMSVSL-KVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 705 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 661 EAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 720 

Qy 706 MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 765 

M I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I II I I I I 

Db 721 MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 780 

Qy 7 66 EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 825 

I I II M I I I I II II I I I I I I I I I I I II I I I I I I I II I I I I I I I II I I I II I I I I II I I I I 

Db 781 EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 840 

Qy 826 SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 885 

I I M I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I 

Db 841 SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 900 



Qy 



886 7^NAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGH 945 



Db 901 ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALA- 959 

Qy 946 TQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGV 1005 

I I I II I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I 1 

Db 960 TQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSWDLLYWRDIKKTGV 1019 

Qy 1006 VFGASLFL'LLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLES 1065 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1020 VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLES 1079 

Qy 1066 EVAISEELVQKYSNSALGHVNCTIKELRRLFLVDD'LVDSLKFAVLMWVFTYVGALFNGLT 1125 

I I I I I I I I I I I II I I I I I I I I I I I I II I I I I I I I I I I I i I I I I I I II I I I I I I I I I I 1 I I 

Db 1080 EVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVIJytWVFTYVGALFNGLT 1139 



Qy 


1126 LLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1178 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 


Db 


1140 LLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1192 


RESULT 4 


ABR59667 


ID 


ARR59667 standard* nrot-pin* 119? AA 


XX 




AC 


ABR59667 ; 


XX 




DT 


22-JUL-2003 (first entry) 


XX 




DE 


Human NogoA protein. 


XX 




KW 


Human; Nogo receptor; NgR; CTS domain; neuroprotective; gene therapy; 


KW 


axonal growth; central nervous system; CNS; Nogo; spinal cord injury; 


KW 


cranial trauma; cerebral trauma; spinal trauma; stroke; Krabbe 1 s disease; 


KW 


demyelinating disease; multiple sclerosis; monophasic demyelination; 


KW 


encephalomyelitis; multifocal leukoencephalopathy ; panencephalitis. 


XX 




OS 


Homo sapiens. 


XX 




PN 


WO2003031462-A2. 


XX 




PD 


17-APR-2003. 


XX 




PF 


04-OCT-2002; 2002WO-US032007 . 


XX 




PR 


06-OCT-2001; 2001US-00972599 . 


XX 




PA 


(UYYA ) UN IV YALE. 


XX 




PI 


Strittmatter SM; 


XX 




DR 


WPI; 2003-393433/37. 


DR 


N-PSDB; ACC81048. 


XX 




PT 


New human Nogo receptor polypeptides and nucleic acids, useful for 


PT 


decreasing inhibition of axonal growth by a central nervous system 


PT 


neuron, or in treating central nervous system disease, disorder or 


PT 


injury, e.g. spinal cord injury. 



PS Disclosure; Page 131-135; 148pp; English. 
XX 

CC The invention relates to a novel nucleic acid encoding a polypeptide 

CC comprising amino acid residues 27-309 of a 473 amino acid sequence (PI, 

CC human Nogo receptor (NgR) NTLRRCT domain), or residues 27-309 of PI with 

CC 1-20 conservative amino acid substitutions, and less than a complete CTS 

CC domain, provided that a partial CTS domain, if present, consists of no 

CC more than the first 39 consecutive residues. The nucleic acid of the 

CC invention has neuroprotective activity. The polynucleotide may have a use 

CC in gene therapy. The nucleic acid is useful for decreasing inhibition of 

CC axonal growth by a central nervous system (CNS) neuron. The NgR 

CC polypeptide or an agent inhibits the binding of Nogo to NgR or NgR- 

CC dependent signal transduction in the central nervous system neuron may be 

CC used in treating central nervous system disease, disorder or injury, e.g. 

CC spinal cord injury. Expression of an NgR protein may be associated with 

CC inhibition of axonal regeneration following cranial, cerebral or spinal 

CC trauma, stroke or a demyelinating disease, such as multiple sclerosis, 

CC monophasic demyelination, encephalomyelitis, multifocal 

CC leukoencephalopathy, panencephalitis, or Krabbe's disease. The present 

CC sequence is used in the exemplification of the invention 

XX 

SQ Sequence 1192 AA; 

Query Match 98.2%; Score 5815; DB 6; Length 1192; 

Best Local Similarity 97.2%; Pred. No. 1.6e-292; 

Matches 1160; Conservative 4; Mismatches 13; Indels 16; Gaps 3 

Qy 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

Qy 61 AGL S AAP VPT AP AAGAP LMD FGND EVP PAP RGP L PAAP P VAP ERQ P S WD P S P VS S T VP AP 12 0 

I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I ! ! II I I I I I I I I I I I 
Db 61 AGL S AAP VPT AP AAGAP LMD FGND FVP PAP RGP L PAAP P VAP ERQ P S WD P S P VS S T VP AP 12 0 

Qy 121 S P L S AAAVS P S KL P EDDE P PARP P P P P PAS VS P QAE P VWT P P APAPAAP P S T PAAP KRRG 180 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 18 0 

Qy 181 SSGA WXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSP 22 6 

Ml: I : : I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I 

Db 181 SSGSVDETLFALPAASEPVIRSSAENMDLKEQPGNTISAGQEDFPSVLLETAASLPSLSP 240 

Qy 227 LSAASFKEHEYLGNLSTVLPTEGTLQENVS EASKEVSEKAKTLLIDRDLTEFSELEYSEM 286 

I I I I I I I I I I I I I 1 I I I II I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I M I M I I I I I 
Db 241 LSAASFKEHEYLGNLSTVLPTEGTLQENVS EASKEVSEKAKTLLIDRDLTEFSELEYSEM 300 

Qy 287 GSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDEW 346 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I M I II I I I I I I 
Db 301 GS S FSVS PKAESAVI VANPREEI I VKNKDEEEKLVSNNI LHNQQELPTALTKLVKEDEW 360 

Qy 347 S SEKAKDS FNEKRVAVEAPMREEYADFKPFERWEVKDS KEDS DMLAAGGKI ESNLES KV 4 06 

I I I I I I I I I I I I I I i I I I I M I I I I I I I I I I I I I I I I I I I I I I I II I I I I t II I I I I II I 
Db 361 S SEKAKDS FNEKRVAVEAPMREEYADFKPFERVWEVKDS KEDS DMLAAGGKI ESNLES KV 420 



Qy 



4 07 DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIF 4 66 



Db 421 DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIF 480 

Qy 467 PLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTE 526 

III I I I II I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 
Db 481 PLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLTKVTE 540 

Qy 527 EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 586 

I I I I I I I I I I II I I I I I I I I I II I I I I M I I I I I I I I I I I I I I I I I I I I I I II I I I I II I 
Db 541 EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 600 

Qy 587 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 64 6 

t II I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 
Db 601 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 660 

Qy 647 EAMSVSL-KVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 705 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 
Db 661 EAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 72 0 

Qy 706 MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 765 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I II I I I M I I I 
Db 721 MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 78 0 

Qy 766 EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 825 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I II I I II I I 1 I I I I I I I I I I I I I I I I I 
Db 781 EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 840 

Qy 826 SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 885 

I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 841 SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 900 

Qy 886 ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGH 945 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 901 ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALA- 959 

Qy 946 TQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGV 1005 

I I I M I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I 

Db 960 TQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSWDLLYWRDIKKTGV 1019 

Qy 1006 VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLES 1065 

II I I 1 I I I I I I I I I I I I I M I II I I II I I II I I I I I I I I I I I I II I I I I I I I I I I I II II 

Db 1020 VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLES 107 9 

Qy 1066 EVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWFTYVGALFNGLT 1125 

I II I I I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I II 
Db 1080 E VAI SEE L VQ K Y S N SAL GH VN C T I K E L RRL FL VD D LVD S L K FAVLMWVFT YVG AL FN G LT 1139 

Qy 1126 LLIIALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1178 

I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1140 LLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1192 

RESULT 5 
AAY56967 

ID AAY56967 standard; protein; 1192 AA. 
XX 

AC AAY56967; 



XX 

DT 25-APR-2000 (first entry) 
XX 

DE Human MAGI polypeptide. 
XX 

KW MAGI protein; neuroendocrine-specif ic protein; neuropathy; human; 

KW spinal injury; neuronal degeneration; neuromuscular disorder; cancer; 

KW psychiatric disorder; developmental disorder; inflammatory disorder; 

KW stroke; cytostatic; cerebroprotective ; neuroprotective. 

XX 

OS Homo sapiens. 
XX 

PN WO200005364-A1. 
XX 

PD 03-FEB-2000. 
XX 

PF 21-JUL-1999; 99WO-GB002360 . 
XX 

PR 22-JUL-1998; 98GB-00016024 . 

PR 19-JUL-1999; 99GB- 000168 98 . 
XX 

PA (SMIK ) SMITHKLINE BEECHAM PLC. 
XX 

PI Michalovich D, Prinjha RK; 
XX 

DR WPI; 2000-182693/16. 

DR N-PSDB; AAZ56886. 
XX 

PT Novel polypeptides related to neuroendocrine-specif ic proteins and 

PT polynucleotides useful for diagnosis of various diseases and for 

PT treatment of cancer and neurological disorders. 
XX 

PS Claim 2; Page 20-21; 35pp; English. 
XX 

CC The invention relates to human MAGI protein, which is similar to 

CC neuroendocrine-specif ic protein. The MAGI protein can be expressed by 

CC standard recombinant methodology. The MAGI polypeptides, polynucleotides 

CC and antibodies are useful for treating diseases, including neuropathies, 

CC spinal injury, neuronal degeneration, neuromuscular disorders, 

CC psychiatric disorders and developmental disorders, cancer, stroke and 

CC inflammatory disorders. The polynucleoitde is also useful for chromosome 

CC localization and for tissue expression studies. The present sequence 

CC represents the human MAGI protein 

XX 

SQ Sequence 1192 AA; 

Query Match 98.1%; Score 5810; DB 3; Length 1192; 
Best Local Similarity 97.2%; Pred. No. 2.9e-292; 

Matches 1159; Conservative 4; Mismatches 14; Indels 16; Gaps 3; 

Qy 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I 

Db 1 MEDLDQSPLVS S SDS PPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

Qy 61 AGL S AAP VP T AP AAGAP LMDFGND FVP PAP RG P L P AAP P VAP ERQ P S WD P S P VS S T VP AP 120 

I I I I I 11 I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 1 I I I I I I I I I I I I II I 

Db 61 AGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAP 120 



Qy 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 180 

II I I I I I I I I 11 I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 180 

Qy 181 SSGA VVXXXXKIMDLKEQPGNTI SAGQEDFPSVLLETAASXPSLS P 226 

III: I : : I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I 

Db 181 SSGSVDETLFALPAASEPVIRSSAENMDLKEQPGNTISAGQEDFPSVLLETAASLPSLSP 240 

Qy 227 LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 286 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 241 LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 300 

Qy 287 GS S FSVS PKAESAVI VANPREEI I VKNKDEEEKLVSNNI LHXQQELPTALTKLVKEDEW 346 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I II I I I I I I I I I I I I I I I 
Db 301 GS S FSVS PKAESAVI VANPREEI I VKNKDEEEKLVSNNI LHNQQELPTALTKLVKEDEW 360 

Qy 347 SSEKAKDSFNEKRVAVEAPMREEY7VDFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKV 406 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 
Db 361 S S EKAKDS FNEKRVAVEAPMREEYADFKP FERVWEVKDS KEDSDMLAAGGKI ESNLES KV 420 

Qy 407 DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIF 466 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 1 I I I I I II I I I I I I 
Db 421 DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRPGAYITCAPFNPAATESIATNIF 480 

Qy 467 PLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTE 52 6 

I II I I I I I I I I I I I I I I I I I II I I I M I I II II I I I I I II I I I I I I I I I I I I I II I I 
Db 481 PLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLTKVTE 540 

Qy 527 EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 586 

I II II I I I I I M I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 541 EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 600 

Qy 587 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 646 

I I I I II I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 601 EESEATPSPVLPDIVNEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 660 

Qy 647 EAMSVSL-KVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 705 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I I M I I I I I I I I I I I I I I I I I I I 
Db 661 EAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 720 

Qy 706 MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 7 65 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 721 MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 780 

Qy 766 EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 825 

I I I I I I M I II I I I I I M I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 781 EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 840 

Qy 826 SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 885 

I I I I I I I I M I I I I I I I I II I I I I I I I I I I I I I I I I I I I I II I I I I II II I I I I I I I I II 
Db 841 SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 900 

Qy 886 ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGH 945 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I M I I I I I I 
Db 901 ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALA- 959 



Qy 946 TQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGV 1005 

I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I II I I I I I : I II I I I I I I II I I I I I I I I 
Db 960 TQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSWDLLYWRDIKKTGV 1019 

Qy 1006 VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLES 1065 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I II I I I I I I I I I 
Db 1020 VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI QAIQKSDEGHPFRAYLES 1079 

Qy 1066 EVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWFTYVGALFNGLT 1125 

I I I I I I I I I I I I I I I I I I I I M I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I II I II I 
Db 1080 EVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLT 1139 

Qy 1126 LLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1178 

I I I I I I I II I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I II I I I I I I I 
Db 1140 LLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1192 



RESULT 6 
AAB82349 

ID AAB82349 standard; protein; 1192 AA. 
XX 

AC AAB82349; 
XX 

DT 23-JUL-2001 (first entry) 
XX 

DE Human NOGO-A protein. 
XX 

KW NOGO-A; human; chromosome 2p21; neuropathy; spinal injury; brain injury; 

KW stroke; neuronal degeneration; Alzheimer's disease; Parkinson's disease; 

KW neuromuscular disorder; psychiatric disorder; developmental disorder; 

KW neuroprotective; nootropic; neuroleptic; antiparkinsonian; 

KW cerebroprotective; neuroleptic; diagnosis; therapy. 
XX 

OS Homo sapiens. 
XX 

PN WO200136631-A1. 
XX 

PD 25-MAY-2001. 
XX 

PF 14-NOV-2000; 2 000WO-GB004345 . 
XX 

PR 15-NOV-1999; 99GB-00026995 . 

PR 24-JAN-2000; 2000GB-00001550 . 
XX 

PA (SMIK ) SMITHKLINE BEECHAM PLC. 
XX 

PI Michalovich D, Prinjha R; 
XX 

DR WPI; 2001-343822/36. 

DR N-PSDB; AAF90324. 
XX 

PT New polypeptide designated NOGO-C is a splice variant of the human NOGO 

PT gene and may be useful in the treatment of neural disorders including 

PT Alzheimer's and Parkinson's diseases. 
XX 

PS Disclosure; Page 26-27; 25pp; English. 
XX 



CC The present sequence is that of human NOGO-A. NOQO-A is a previously 

CC known splice variant of the human NOGO gene on chromosome 2p21. The 

CC invention relates to a novel splice variant, NOGO-C (see AAB82348). It 

CC provides NOGO-C polypeptides and polynucleotides, and methods for 

CC producing such polypeptides by recombinant techniques. Also disclosed are 

CC methods for utilising NOGO-C polypeptides and polynucleotides in the 

CC treatment of diseases including neuropathies, spinal injury, brain 

CC injury, stroke, neuronal degeneration, for example Alzheimer's disease 

CC and Parkinson's disease, neuromuscular disorders, psychiatric disorders 

CC and developmental disorders. Also provided are methods for identifying 

CC agonists and agonists for use in treating conditions associated with NOGO 

CC -C imbalance, and diagnostic assays for detecting diseases associated 

CC with inappropriate NOGO-C activity or levels 

XX 

SQ Sequence 1192 AA; 

Query Match 98.1%; Score 5810; DB 4; Length 1192; 

Best Local Similarity 97.2%; Pred. No. 2.9e-292; 

Matches 1159; Conservative 4; Mismatches 14; Indels 16; Gaps 3; 

Qy 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I 1 I I I I I I I 
Db 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

Qy 61 AGL S AAP VPT AP AAGAP LMD FGN D FVP PAP RGP L PAAP P VAP ERQ P S WD P S P VS S T VPAP 120 

I I I 1 I M I I I I I I I I I I I I I I I' I I I I i I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I 

Db 61 AG L S AAP VP TAP AAGAP LMD FGN D FVP PAP RG P L PAAP P VAP E RQ PSWDPSPVSS T VP AP 120 

Qy 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 180 

II I 1 I M I I I I I I I I II I I I I I I 1 I I I I II I I I M M I I I I I I I I I I I I I I I I I I I I I I I 

Db 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 180 

Qy 181 SSGA WXXXXKIMDLKEQPGNTI SAGQEDFPSVLLETAASXPSLS P 226 

III: I : : I I I I I I I I I I M I I I I M I I I I I I I I I I I I I II 

Db 181 SSGSVDETLFALPAASEPVIRSSAENMDLKEQPGNTISAGQEDFPSVLLETAASLPSLSP 240 

Qy 227 LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 2 86 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I i I I I I I I I II I I I 
Db 241 LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 300 

Qy 287 GS S FSVS PKAESAVI VANPREEI I VKNKDEEEKLVSNNI LHXQQELPTALTKLVKEDEW 346 

I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 

Db 301 GSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHNQQELPTALTKLVKEDEW 360 

Qy 347 SSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKV 4 06 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I 

Db 361 SSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKV 420 

Qy 407 DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIF 466 

I I I I I I I I I I II I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRPGAYITCAPFNPAATESIATNIF 480 

Qy 467 PLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTE 526 

Ml I I I I II I I I I I I I I I I I II I I I I I I M I I I I I I I I I I I I II I I I I II I I I I I I I 
Db 4 81 PLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLTKVTE 54 0 



Qy 



527 EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 586 



I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I 1 I I I I II I I I I I I I I I I 

Db 541 EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 600 

Qy 587 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 646 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 
Db 601 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 660 

Qy 647 EAMSVSL-PCVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 705 

I I I I I I I I I I I I I I II I I I i I I I I I I t II I I I II I I I I II I I I I I I I I I I I I I I I I I II 
Db 661 EAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 72 0 

Qy 706 MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 765 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 721 MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 780 

Qy 766 EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 825 

I I I I I I I I I I I I I I I I I I M I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I 
Db 781 EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 84 0 

Qy 826 SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 885 

I II I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 841 SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 900 

Qy 886 ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGH 945 

I I I I I I I II I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I 
Db 901 ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALA- 959 

Qy 946 TQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGV 1005 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I : I I I I I I II I I I I I I I I I I I 
Db 960 TQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSWDLLYWRDIKKTGV 1019 

Qy 1006 VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLES 1065 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I II I I 
Db 1020 VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLES 1079 

Qy 1066 EVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLT 1125 

I I I I M I II I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1080 EVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWFTYVGALFNGLT 1139 

Qy 112 6 LLI LALI SLFSVPVI YERHQAQI DH YLGLANKNVKDAMAKI QAKI PGLKRKAE 1178 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 
Db 1140 LLI LALI SLFSVPVI YERHQAQI DHYLGLANKNVKDAMAKIQAKI PGLKRKAE 1192 



RESULT 7 
ABG30938 

ID ABG30938 standard; protein; 1192 AA. 
XX 

AC ABG30938; 
XX 

DT 21-OCT-2002 (first entry) 
XX 

DE Human NogoA protein. 
XX 

KW Human; Nogo; BACE; acute neuronal injury; spinal injury; head injury; 

KW stroke; peripheral nerve damage; neoplastic disorder; glioblastoma; 

KW neuroblastoma; hyperprolif erative disorder; dysprolif erative disorder; 



KW cirrhosis; psoriasis; keloid formation; fibrocystic condition; cancer; 

KW tissue hypertrophy; central nervous system; axon regeneration; NogoA; 

KW Nogo-associated disease; metastasis. 
XX 

OS Homo sapiens . 
XX 

PN WO200257483-A2. 
XX 

PD 25-JUL-2002. 
XX 

PF 18-JAN-2002; 2002WO-GB000228 . 
XX 

PR 18-JAN-2001; 2001GB-00001312 . 
XX 

PA (GLAX ) GLAXO GROUP LTD. 

PA (SMIK ) SMITHKLINE BEECHAM PLC. 

XX 

PI Blackstock WP, Hale RS, Prinjha R, Rowley A; 
XX 

DR WPI; 2002-599722/64. 

DR N-PSDB; ABK90134. 
XX 

PT Identifying modulators of Nogo or BACE activity for treating acute 

PT neuronal injuries, neoplastic or dysprolif erative disorders, comprises 

PT providing and monitoring interaction between Nogo and BACE polypeptides. 

XX 

PS Disclosure; Page 59-62; 68pp; English. 
XX 

CC The present invention relates to a new method of identifying modulators 

CC of Nogo function or BACE activity. The method involves providing Nogo and 

CC BACE polypeptides capable of binding with each other, monitoring the 

CC interaction between these polypeptides, and determining if the test agent 

CC is a modulator of Nogo or BACE activity. The method is useful in treating 

CC acute neuronal injuries, such as spinal or head injury, stroke, 

CC peripheral nerve damage, and in neoplastic (e.g. glioblastomas, 

CC neuroblastomas), hyperprolif erative or dysprolif erative disorders (e.g. 

CC cirrhosis, psoriasis, keloid formation, fibrocystic conditions, tissue 

CC hypertrophy) of the central nervous system. The BACE polypeptide is 

CC useful in screening methods to identify agents that may act as modulators 

CC of BACE activity and in particular agents that may be useful in treating 

CC Nogo-associated diseases. The modulators of Nogo or BACE polypeptides, 

CC and the polynucleotide encoding the BACE polypeptide are useful in 

CC manufacturing a medicament for the treatment or prevention of disorders 

CC responsive to the modulation of Nogo activity, in alleviating the 

CC symptoms or improving the condition of a patient suffering from this 

CC disorder, in axon regeneration, or in preventing metastasis or spreading 

CC of a cancer. The polynucleotide may also be an essential component in 

CC assays, a probe, in recombinant protein synthesis, and in gene therapy 

CC techniques. The present amino acid sequence represents the human NogoA 

CC protein of the invention 

XX 

SQ Sequence 1192 AA; 



Query Match 98.1%; Score 5810; DB 5; Length 1192; 

Best Local Similarity 97.2%; Pred. No. 2.9e-292; 

Matches 1159; Conservative 4; Mismatches 14; Indels 16; Gaps 3 



Qy 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

I I I I I I I I I II ! I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

Qy 61 AGL S AAP VP T AP AAGAP LMD FGN D FVP PAP RGP L P AAP P VAP ERQ P S WD P S P VS S T VP AP 12 0 

I I I I I I I I I I II I I II I I II I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I 
Db 61 AGL S AAP VPT APAAGAP LMD FGN D FVP PAP RGP L PAAP P VAP E RQ P S WD P S P VS S T VP AP 120 

Qy 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 180 

I I I I I I I I I I I II I I I I I I I M I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 180 

Qy 181 SSGA WXXXXKIMDLKEQPGNTI SAGQEDFPSVLLETAASXPSLS P 226 

III: I : : I I I M I I I I I I I I I I I I I I I I I I I I I I I II I I I 

Db 181 SSGSVDETLFALPAASEPVIRSSAENMDLKEQPGNTISAGQEDFPSVLLETAASLPSLSP 240 

Qy 227 LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 286 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 LSAAS FKEHEYLGNLSTVLPTEGTLQENVSEAS KEVSEKAKTLLI DRDLTEFS ELEYS EM 300 

Qy 287 GS S FS VS PKAESAVI VANPREEI I VKNKDEEEKLVSNNI LHXQQELPTALTKLVKEDEW 346 

I II I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I 
Db 301 GS S FSVS PKAESAVI VANPREEI I VKNKDEEEKLVSNNI LHNQQELPTALTKLVKEDEVV 360 

Qy 347 SSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKV 4 06 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I 
Db 361 SSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKV 420 

Qy 407 DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIF 466 

I I II I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
Db 421 DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRPGAYITCAPFNPAATESIATNIF 480 

Qy 467 PLLEDPTSENXTDEKKIEEKKAQIWEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTE 526 

III I I I I I I I I I II I I I II I I II I I I I I I I I I I I II I I I I I I I II I I I II I I I I I I I 
Db 4 81 PLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLTKVTE 540 

Qy 527 EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 586 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 541 EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 600 

Qy 587 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 646 

I I I I I I I I I I I I I I I I I M I II I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I II I I 
Db 601 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 660 

Qy 647 EAMSVSL-KVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 7 05 

I I I I I I I I II I II I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 661 EAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 720 

Qy 706 MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 765 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I II 
Db 721 MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 780 

Qy 766 EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 825 

I I I I I I I I I II I I I I I I II I I I I I I I I I I II I I I I I I I I I II I M I I I I I I I I I I I I I I I 
Db 781 EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 840 

Qy 826 SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 885 



Db 841 SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 900 

Qy 886 7\NAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGH 945 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 1 
Db 901 ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALA- 959 

Qy 946 TQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGV 1005 

M I I I I I I I I I II I I M I I I I I I I I I I I I I I I I I I II I : I I I 1 I I I I I I I II I I I I I I 
Db 960 TQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSWDLLYWRDIKKTGV 1019 

Qy 1006 VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLES 1065 

I I I I I I I I I I II I I I I I I II I I I II II I I I I I I I I I I I I I I I I I I I I II I I I I I I I II I I 
Db 1020 VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLES 1079 

Qy 1066 EVAISEELVQKYSNSALGHWCTIKELRRLFLVT^ 1125 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1080 EVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVG7VLFNGLT 1139 



Qy 


1126 LLIIALISLFSVPVIYERHQAQIDHYLGLANKWKDAMAKIQAKIPGLKRKAE 1178 




1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


Db 


1140 LLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1192 


RESULT 8 


ABB81078 


ID 


ABB81078 standard; protein; 1192 AA. 


XX 




AC 


ABB81078; 


XX 




DT 


05-NOV-2002 (first entry) 


XX 




DE 


Human neurotransmitter receptor protein Nogo-A. 


XX 




KW 


Nerve regeneration; neuroprotection; neuronal degeneration; CNS; PNS; 


KW 


central nervous system; peripheral nervous system; tranquillizer; Nogo; 


KW 


vulnerary; cerebroprotective ; anti-tumour; antidiabetic; anticonvulsant; 


KW 


nootropic; antiparkinsonian; ophthalmological ; analgesic; hepatotropic; 


KW 


osteopathic; vasotropic; nephrotropic ; cytostatic; antigen; gene therapy 


KW 


neurotransmitter receptor; human; receptor. 


XX 




OS 


Homo sapiens. 


XX 




PN 


US2002072493-A1. 


XX 




PD 


13-JUN-2002 . 


XX 




PF 


28-JUN-2001; 2 001US-0089334 8 . 


XX 




PR 


19-MAY-1998; 98IL- 0012 4500 . 


PR 


21-JUL-1998; 98WO-US0147 15 . 


PR 


22-DEC-1998; 98US-0 0218277 . 


PR 


19-MAY-1999; 99US-00314161 . 


XX 




PA 


(YEDA ) YE DA RES & DEV CO LTD. 


XX 




PI 


Eisenbach-Schwartz M, Hauben E, Cohen IR, Beserman P, Mosonego A; 



PI Moalem G; 
XX 

DR WPI; 2002-607255/65. 

DR N-PSDB; ABN86601. 
XX 

PT Promoting nerve regeneration and preventing neuronal degeneration in the 

PT central/peripheral nervous system from injury/disease, comprises 

PT administering nervous system-specific activated T cells/antigen, or 

PT analogs/peptides . 

XX 

PS Example; Page 53-56; 93pp; English. 
XX 

CC The invention relates to promoting nerve regeneration or conferring 

CC neuroprotection and preventing or inhibiting neuronal degeneration in the 

CC central/peripheral nervous system (NS) . The method involves administering 

CC NS-specific activated T cells, NS-specific antigen, its analogue or its 

CC peptide, a nucleotide sequence the NS-specific antigen or its analogue or 

CC combinations. The method is useful for promoting nerve regeneration and 

CC preventing neuronal degeneration in central/peripheral nervous system 

CC from injury/disease, where the injury is spinal cord injury, blunt 

CC trauma, penetrating trauma, hemorrhagic stroke, ischaemic stroke or 

CC damages caused by surgery such as tumour excision. The disease is not an 

CC autoimmune disease or neoplasm. The disease results in a degenerative 

CC process occurring in either gray or white matter or both. The disease is 

CC diabetic neuropathy, senile dementia, Alzheimer's disease, Parkinson's 

CC disease, facial nerve (Bell's) palsy, glaucoma, Huntington's chorea, 

CC amyotrophic lateral sclerosis, non-arteritic optic neuropathy, and 

CC vitamin deficiency, intervertebral disc herniation, prion diseases such 

CC as Creutzfeldt- Jakob disease, carpal tunnel syndrome, peripheral 

CC neuropathies associated with various diseases, including but not limited 

CC to uremia, porphyria, hypoglycemia, Sjorgren Larsson syndrome, acute 

CC sensory neuropathy, chronic ataxic neuropathy, biliary cirrhosis, primary 

CC amyloidosis, obstructive lung diseases, acromegaly, malabsorption 

CC syndromes, polycythemia vera, immunoglobulin (Ig)A- and IgG gamma- 

CC pathies, complications of various drugs (e.g., metronidazole) and toxins 

CC (e.g., alcohol or organophosphates ) , Charcot-Marie-Tooth disease, ataxia 

CC telangectasia, Friedreich's ataxia, amyloid polyneuropathies, 

CC adrenomyeloneuropathy, Giant axonal neuropathy, Ref sum's disease, Fabry's 

CC disease, or lipoproteinemia . The present sequence represents the human 

CC neurotransmitter receptor protein Nogo-A, an example of NS-specific 

CC antigen 

XX 

SQ Sequence 1192 AA; 

Query Match 98.1%; Score 5810; DB 5; Length 1192; 

Best Local Similarity 97.2%; Pred. No. 2.9e-292; 

Matches 1159; Conservative 4; Mismatches 14; Indels 16; Gaps 3 

Qy 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

I I I I I I I I I II I I I I I I I M I I I I I I I I I II I I I I I I I I I I I I I I I I M I I I I I I I I I I I 
Db 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

Qy 61 AGL S AAP VPT AP AAGAP LMD FGND FVP PAP RG P L P AAP P VAP ERQPSWDPS P VS S T VPAP 120 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I t I II I I I I I II I II I 
Db 61 AGL S AAP VPTAP AAGAP LMD FGND FVP PAP RG P L P AAP P VAP ERQPSWDPS P VS S T VP AP 120 



Qy 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPWTPPAPAPAAPPSTPAAPKRRG 180 



Db 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 180 

QY 181 SSGA WXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSP 226 

IN: I : : I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I 

Db 181 SSGSVDETLFALPAASEPVIRSSAENMDLKEQPGNTISAGQEDFPSVLLETAASLPSLSP 240 

QY 227 LS7^ASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 286 

M I I I I I I I I I I I II II I II I I I I I I II II M I I I I I I I II I I I I I I I I I I I I | | | | | M 
Db 241 LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 300 

Qy 287 GSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDEW 34 6 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M II I I I I I I I I 
Db 301 GS S FSVS PKAESAVI VANPREEI I VKNKDEEEKLVSNNI LHNQQELPTALTKLVKEDEVV 360 

Qy 347 S S EKAKD S FN E KRVAVEAPMREE YAD FK P FERVWE VKD S KED S DMLAAGGK I E S N L E S KV 406 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 
Db 361 SSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKV 42 0 

Qy 407 DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIF 466 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II I I I II I 11 I I I I I I I I I I I 
Db 421 DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRPGAYITCAP FNPAATESIATNIF 480 

Qy 467 PLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTE 526 

I I I I I I I I I I I I I I I I I I I I I I I II I I I M I II I I I I II II I I I I I I I I I I | I I | | | 
Db 481 PLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLTKVTE 540 

Qy 527 EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 586 

I M I I I I I I I I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 
Db 541 EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 600 

Qy 587 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 646 

I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 601 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 660 

Qy 647 EAMSVSL-KVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 705 

M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | M I I I I I I I I I 
Db 661 EAMSVSLKKVSGIKEEIKEPENIN7^ALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 720 

Qy 706 MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 765 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 721 MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 780 

Qy 7 66 EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 825 

I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 
Db 781 EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 840 

Qy 826 SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 885 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I M I I I I I I I I I I I I I II I I 
Db 841 SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 900 

Qy 886 ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGH 945 

I I I I I I I I I I I I I M I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 
Db 901 ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALA- 959 

Qy 946 TQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGV 1005 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I | | : | | || | | | | | | | | | | | | | | | 



Db 



960 TQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSWDLLYWRDIKKTGV 1019 



Qy 


1006 


VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLES 
M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 M 1 1 | | | | | | | | | | | | | | | | | | M | | | 
VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLES 


1065 


Db 


1020 


1079 


Qy 


1066 


EVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLT 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | I I | | | | | | | | | | | | | | | | | M II I 1 

EVAISEELVQKYSNSALGHWCTIKELRRLFLA^DLVDSLKFAVLMWVFTYVGALFNGLT 


1125 


Db 


1080 


1139 


Qy 

Db 


1126 
1140 


LLI LALI SLFSVPVI YERHQAQI DH YLGLANKNVKDAMAKI QAKI PGLKRKAE 1178 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 
LLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1192 





RESULT 9 
AAU33228 

ID AAU33228 standard; protein; 1246 AA. 
XX 

AC AAU3322 8; 
XX 

DT 18-DEC-2001 (first entry) 
XX 

DE Novel human secreted protein #3719. 
XX 

KW Human; vaccination; gene therapy; nutritional supplement; 

KW stem cell proliferation; haematopoiesis ; nerve tissue regeneration; 

KW immune suppression; immune stimulation; anti-inflammatory; leukaemia. 

XX 

OS Homo sapiens . 
XX 

PN WO200179449-A2. 
XX 

PD 25-OCT-2001. 
XX 

PF 16-APR-2001; 2001WO-US008 656 . 
XX 

PR 18-APR-2000; 2000US-00552929 . 

PR 26-JAN-2001; 2001US-00770160 . 
XX 

PA (HYSE-) HYSEQ INC. 
XX 

PI Tang YT, Liu C, Drmanac RT; 
XX 

DR WPI; 2001-611725/70. 
XX 

PT Nucleic acids encoding a range of human polypeptides, useful in genetic 

PT vaccination, testing and therapy. 

XX 

PS Claim 20; Page 737; 765pp; English. 
XX 

CC The invention relates to novel human secreted polypeptides. The 

CC polypeptides and antibodies to the polypeptides are useful for 

CC determining the presence of or predisposition to a disease associated 

CC with altered levels of polypeptide. The polypeptides are also useful for 

CC identifying agents (agonists and antagonists) that bind to them. Cells 

CC expressing the proteins are useful for identifying a therapeutic agent 



CC for use in treatment of a pathology related to aberrant expression or 

CC physiological interactions of the polypeptide. Vectors comprising the 

CC nucleic acids encoding the polypeptides and cells genetically engineered 

CC to express them are also useful for producing the proteins. The proteins 

CC are useful in genetic vaccination, testing and therapy, and can be used 

CC as nutritional supplements. They may be used to increase stem cell 

CC proliferation; to regulate haematopoiesis ; and in bone, cartilage, tendon 

CC and/or nerve tissue growth or regeneration; immune suppression and/or 

CC stimulation; as anti-inflammatory agents; and in treatment of leukaemias. 

CC AAU29510-AAU33304 represent the amino acid sequences of novel human 

CC secreted proteins of the invention 

XX 

SQ Sequence 1246 AA; 

Query Match 93.3%; Score 5526.5; DB 4; Length 1246; 

Best Local Similarity 93.0%; Pred. No. 1.5e-277; 

Matches 1123; Conservative 13; Mismatches 39; Indels 33; Gaps 10; 

MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 
I I II I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I | I | 
MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 101 

AGL S AAP VP T APAAGAP LMD FGNDFVP PAP RG P L PAAP P VAP ERQPSWDPS P VS S TVPAP 12 0 
I I M I I I I I I I I I I I M M I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I | I | 
AGL S AAP VP T APAAGAP LMDFGND FVP PAP RG P L PAAP P VAP ERQPSWDPS P VS S TVPAP 161 

SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 180 
I I M I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I || I 
SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPWTPPAPAPAAPPSTPAAPKRRG 221 

SSGA WXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSP 22 6 

I I I : I : : I I I I I M I I I I I I I I I II I I I I I I M I I I I M I 

SSGSVDETLFALPAASEPVIRSSAENMDLKEQPGNTISAGQEDFPSVLLETAASLPSLSP 281 

LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 28 6 
I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I M II I I I I I I I I I I I I I I I I I I I I I II I I 
LSAASFKEHEYLGNLSTVLPTEGTLQENVS EASKEVSEKAKTLLIDRDLTEFSELEYSEM 341 

GSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDEW 34 6 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I M II I I I I I I I I I I I I I I I I I I I II 
GSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHNQQELPTALTKLVKEDEW 4 01 

SSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKV 406 
I I I I I I I I M I I I I I I II I I II I I I I I I I || I I | || I I I I I II I I I I I I I I I I I I II I I I 
SSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKV 4 61 

DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIF 4 66 
I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I || I I || I I I I | || | | | | | | | | | | | | | | | | 
DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIF 521 

PLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTE 526 
IN I I I M I I I I I I I I I I I I I I I II I I I I II I I I II I I I I I I I I I I I I I I I I I I I I I 
PLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLTKVTE 581 

EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 586 
I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I II I I I I I I II I II I I I I I I I I I I | I | I I I 
EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 641 



Qy 


1 


Db 


42 


Qy 


61 


Db 


102 


Qy 


121 


Db 


162 


Qy 


181 


Db 


222 


Qy 


227 


Db 


282 


Qy 


287 


Db 


342 


Qy 


347 


Db 


402 


Qy 


407 


Db 


462 


Qy 


467 


Db 


522 


Qy 


527 


Db 


582 



Qy 587 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 646 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 642 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 701 

Qy 647 EAMSVSL-KVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 705 

I I I M I I M I I I I I I I I I I I II I I I I I I I I I I II M I II I I I I I I I I I I I I I I I I I I I I 
Db ' 702 EAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 761 

Qy 7 06 MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 765 

I I I I M I I I I I I II I I I I I I M II I I I I I I I I I I I I I I I II I I I I I I I I M I I I I I I I I I 
Db 762 MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 821 

Qy 766 EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 825 

I I I I I I I I I M M I I M II I I I I I I I I II I I I I I I I I I I I I I I I I I || | | | | | | | | | | | | 
Db 822 EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 881 

Qy 826 SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 885 

I II I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 
Db 882 SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 941 

Qy 886 ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGH 945 

I I I I I I I I I M I M I I I I I I I II I I I I I I I I I I II I I I I I I I I I I I I II I I I I I II I I 
Db 942 ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALA- 1000 

Qy 946 TQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGV 1005 

I I I I I I I I I I I I I I I I I II I I I I I I I M I I I I I I I I I I : I I I I II I I I I I I I II I I I I 
Db 1001 TQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSWDLLYWRDIKKTGV 1060 

Qy 1006 VFGAS-LFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAY — 1062 

I I I I I : I I I I I I I I I I I I I I I I I I I I I || I I I I I I I I I I I | | | | | | | | | | | | | | | | 
Db 1061 VFGASAVFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAISG 1120 

Qy 1063 -LESEVAISEELVQKYSNSALGHV-NCTIKELRR L FL VD D LVD S L K - FAVLMWVFT Y 1116 

II I : : I : I I I I I I I : I I I : I I I I I I I I I : I M I I I I I I I 

Db 1121 NLESCLYLRELGSGRYSNSALGSMWNCTVKGNFRAPSFFSWMDLVDSLRSFAVLMWVFTY 1180 

Qy 1117 VGALFNGLTLL ILALI SLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKI 1170 

Ml M : I I II : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1181 VGCL — GLMVLDTTGFWALNFI S S SGSWLI YERHQAQI DHYLGLANKNVKDAMAKI QAKI 1238 

Qy 1171 PGLKRKAE 1178 

I I I I I I I I 
Db 1239 PGLKRKAE 1246 



RESULT 10 
ABU11573 

ID ABU11573 standard; protein; 983 AA. 
XX 

AC ABU11573; 
XX 

DT 12-FEB-2003 (first entry) 
XX 

DE Human MDDT polypeptide SEQ ID 52 0. 
XX 

KW MDDT; human; disease detection and treatment molecule polypeptide; 



KW anti-inflammatory; immunosuppressive; osteopathic; cytostatic; anti-HIV; 

KW haemostatic; nephrotropic; antianaemic; antipsoriatic; hepatotropic; 

KW gene therapy; protein replacement therapy; cell proliferative disorder; 

KW cancer; adenocarcinoma; leukaemia; lymphoma; melanoma; myeloma; sarcoma; 

KW anaemia; Crohn ! s disease; acquired immunodeficiency syndrome; AIDS; 

KW Goodpasture's syndrome; inflammation; osteoporosis; thrombocytopaenia ; 

KW psoriasis; hepatitis. 
XX 

OS Homo sapiens. 
XX 

PN WO200279449-A2. 
XX 

PD 10-OCT-2002 . 
XX 

PF 27-MAR-2002; 2002WO-US009944 . 
XX 

PR 28-MAR-2001; 2 001US-0279619P . 

PR 2 9-MAR-2001; 2001US-0280067P . 

PR 29-MAR-2001; 2 001US-028 0068 P . 

PR 16-MAY-2001; 2001US-0291280P . 

PR 17-MAY-2001; 2001US-0291829P . 

PR 17-MAY-2001; 2001US-0291849P . 

PR 19-JUN-2001; 2 001US-0299428P . 

PR 20-JUN-2001; 2 001US-0299776P . 

PR 20-JUN-2001; 2001US-0300001P . 
XX 

PA (INCY-) INCYTE GENOMICS INC. 
XX 

PI Daffo A, Jones AL, Tran AB, Dahl CR, Gietzen D, Chinn J; 

PI Dufour GE, Hillman JL, Yu JY, Tuason 0, Yap PE, Amshey SR; 

PI Daugherty SC, Dam TC, Liu TF, Nguyen DA, Kleefeld Y, Gerstin EH; 

PI Peralta CH, David MH, Lewis SA, Chen AJ, Panzer SR, Harris B; 

PI Flores V, Marwaha R, Lo A, Lan RY, Urashka ME; 

XX 

DR WPI; 2003-058431/05. 

DR N-PSDB; ABX34563. 
XX 

PT New purified disease detection and treatment molecule proteins and 

PT polynucleotides, useful for diagnosing, treating or preventing cancers 

PT (e.g. leukemia or sarcoma), anemia, Crohn's disease, AIDS, osteoporosis 

PT or hepatitis. 

XX 

PS Claim 27; SEQ ID NO 520; 339pp + Sequence Listing; English. 
XX 

CC This invention describes a novel disease detection and treatment molecule 

CC polypeptide (MDDT) which has anti-inflammatory, immunosuppressive, 

CC osteopathic, cytostatic, anti-HIV, haemostatic, nephrotropic, 

CC antianaemic, antipsoriatic and hepatotropic activity. The polynucleotides 

CC and the polypeptides of the invention can be used for gene therapy, 

CC protein replacement therapy and are useful for treating a variety of 

CC diseases or conditions. These polypeptides or polynucleotides are 

CC particularly useful for diagnosing, treating or preventing cell 

CC proliferative disorders (e.g. cancers including adenocarcinoma, 

CC leukaemia, lymphoma, melanoma, myeloma or sarcoma), anaemia, Crohn's 

CC disease, acquired immunodeficiency syndrome (AIDS), Goodpasture's 

CC syndromes, inflammation, osteoporosis, thrombocytopaenia, psoriasis or 

CC hepatitis. ABU11450-ABU11845 represent the MDDT polynucleotides encoded 



CC by ABU11450-ABU11845, described in the disclosure of the invention. NOTE: 

CC The sequence data for this patent did not form part of the printed 

CC specification, but was obtained in electronic format from WIPO at 

CC ftp . wipo . int/pub/published_pct_sequences 

XX 

SQ Sequence 983 AA; 

Query Match 77.0%; Score 4560; DB 6; Length 983; 

Best Local Similarity 97.1%; Pred. No. 1.2e-227; 

Matches 919; Conservative 8; Mismatches 11; Indels 8; Gaps 3 

QY 240 NLSTVL PTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEMGSSFSVS 293 

I : : I : : I : : = I I I I I II I I I I i I I I I I I I I I I I I I I 1 I I I I II II I I I I I 

Db 39 NMNTLVICQQYYPLKEHFKKNVSEASKEVSEKAKTLLIDRDLTEFSELEYSEMGSSFSVS 98 

Qy 294 PKAESAVIVANPREEIIVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDEWSSEKAKD 353 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I | | | | M I I 
Db 99 PKAESAVIVANPREEIIVKNKDEEEKLVSNNILHNQQELPTALTKLVKEDEWSSEKAKD 158 

Qy 354 SFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKVDKKCFAD 413 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I II II I I I I I I I I I I I I I I || I I I I 
Db 159 SFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKVDKKCFAD 218 

Qy 414 SLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIFPLLEDPT 473 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II M I I I III 
Db 219 SLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAP FNPAATESIATNIFPLLGDPT 278 

Qy 474 SENXTDEKKIEEKKAQIWEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTEEWANMP 533 

Ml I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 1 I I I || I | | | | | | | | 
Db 279 SENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLTKVTEEWANMP 338 

Qy 534 EGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSFEESEATP 5 93 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I M 
Db 339 EGLTPDLVQEACESELNEVT.GTKIAYETKMDLVQTSEVMQESLYPAAQLCPSFEESEATP 398 

Qy 594 SPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYEEAMSVSL 653 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 
Db 399 SPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYEEAMSVSL 458 

Qy 654 -KVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQP 712 

I M I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 459 KKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQP 518 

Qy 713 VPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMIEYENKEK 772 

I I I I I I I I I I I I I I M I I I I I I I I I I I I II I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | 
Db 519 VPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMIEYENKEK 578 

Qy 773 LSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVYSNDDLFI 832 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I || 
Db 579 LSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVYSNDDLFI 638 

Qy 833 SKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEIANAPDGA 892 

I I I I I I I M I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II 
Db 639 SKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEIANAPDGA 698 

Qy 893 GSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGHTQAEIES 952 

I I I I I I I I I I I I I I I I I I I I I I M I II I I I I I I I I I I I I I I I II I I I I I I I I I I II I I 



Db 



699 GSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALA-TQAEIES 757 



Qy 953 IVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGWFGASLF 1012 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I II I I I I I I I I I I I I I I I I I I I | | | | 
Db 758 IVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSWDLLYWRDIKKTGWFGASLF 817 

Qy 1013 LLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEE 1072 

I I I I I M I I I I I I I I I I ! I I I I I I I I I I I I I | I I I I I I I | | | | M I I I II I I I I I I I I I I 
Db 818 LLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEE 877 

Qy 1073 LVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWFTWG7VLFNGLTLLILALI 1132 

I M I I I I I I I I I I I I I I I I I II I I || I I M | | | | | | | | | | | | M | | | || | | | | | | | | | | | 
Db 878 LVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWFTYVGALFNGLTLLILALI 937 

Qy 1133 SLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1178 

I I I I I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I I I I I I 
Db 938 SLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 983 



893 AA. 

AAY95012; 

19-JUN-2000 (first entry) 

Human secreted protein vb22_l, SEQ ID NO: 64. 

Human; secreted protein; cancer; tumour; cardiovascular disorder; 
blood disorder; haemophilia; autoimmune disease; diabetes; inflammation; 
infection; fungal; bacterial; viral; HIV; allergy; arthritis; 
neurodegenerative disease; asthma; contraceptive. 

Homo sapiens. 

WO200011015-A1. 



RESULT 11 
AAY95012 

ID AAY95012 standard; protein; 
XX 
AC 
XX 
DT 
XX 
DE 
XX 
KW 
KW 
KW 
KW 
XX 
OS 
XX 
N 
XX 
PD 
XX 
PF 
XX 
PR 
PR 
PR 
PR 
-PR 
PR 
PR 
PR 
XX 
PA 
XX 
PI 
XX 
DR 
XX 



02-MAR-2000. 
24-AUG-1999 

24-AUG-1998 

24- AUG-1998 
09-SEP-1998 
28-SEP-1998 

25- NOV-1998 
23-DEC-1998 
23-DEC-1998 
23-AUG-1999 



99WO-US019351. 

98US-0097 63 < 8p". 
98US-0097659P. 
98US-0099618P. 
98US-0102092J 
98US-0109978P. 
98US-0113645P. 
98US-0113646P. 
9-9US-00379246. 



(ALPH-) ALPHAGENE INC. 
Valenzuela D, Yuan O, 
WPI; 2000-224657/19. 



Hoffman H, Hall J, Rapiejko P; 



PT New secreted or transmembrane proteins and polynucleotides encoding them, 

PT useful for treating neurodegenerative disorders, autoimmune diseases and 

PT cancer. 
XX 

PS Claim 73; Page 322-325; 357pp; English. 
XX 

CC The invention relates to 40 human secreted proteins (AAY94 981-Y95020 ) , 

CC and cDNA sequences encoding them (AAA2342 3-A23462 ) . The secreted proteins 

CC of the invention include those that are thought to be only partially 

CC secreted, i.e., transmembrane proteins. The proteins of the invention may 

CC exhibit one or more activities selected from the following: cytokine 

CC activity; cell proliferation; differentiation; immune modulation; 

CC haematopoiesis regulation; tissue growth activity; activin/inhibin 

CC activity; chemotactic/chemokinetic activity; haemostatic and thrombolytic 

CC activity; anti-inflammatory activity; and tumour inhibition activity. The 

CC proteins may be administered to patients as vaccines, and the nucleotides 

CC may be used as part of a gene therapy regime. Diseases or conditions that 

CC may be treated using the proteins or nucleotides of the invention include 

CC autoimmune diseases; genetic disorders; haemophilia; cardiovascular 

CC diseases; cancer; bacterial, fungal and viral infections, especially HIV; 

CC multiple sclerosis; rheumatoid arthritis; pulmonary inflammation; 

CC Guillain-Barre syndrome; insulin dependent diabetes mellitus; and 

CC allergic reactions such as asthma and anaemia. They may also be used for 

CC treating wounds, burns, ulcers, osteoporosis, osteoarthritis, periodontal 

CC diseases, Alzheimer's disease, Parkinson's disease, Huntington's disease 

CC and amyotrophic lateral sclerosis (ALS) . Proteins with activin/inhibin 

CC activity may additionally be useful as contraceptives. Nucleic acid 

CC sequences of the invention may be used in chromosome mapping, and as a 

CC source of diagnostic primers and probes. The present sequence represents 

CC one of the 40 proteins of the invention 

XX 

SQ Sequence 8 93 AA; 

Query Match 74.3%; Score 4400; DB 3; Length 893; 

Best Local Similarity 98.9%; Pred. No. 2e-219; 

Matches 884; Conservative 1; Mismatches 7; Indels 2; Gaps 2; 

Qy 286 MGS S FSVS PKAESAVI VANPREEI I VKNKDEEEKLVSNNI LHXQQELPTALTKLVKEDEV 345 

I I I M I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 

Db 1 MGS S FSVS PKAESAVI VANPREEI I VKNKDEEEKLVSNNI LHNQQELPTALTKLVKEDEV 60 

Qy 346 VSSEKAKDSFNEKRVAVEAPMREEYADFKPFERWEVKDSKEDSDMLAAGGKIESNLESK 405 

I M I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I | I I M I I I I I I I I I 
Db 61 VSSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESK 120 

Qy 406 VDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNI 4 65 

I M I M I I I I I I I I I I I I I I I 1 I I I I I I I M I I I II I I I I I I I I I I I I I I I I I I I I | I M 
Db 121 VDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAP FNPAATESIATNI 180 

Qy 4 66 FPLLEDPTSENXTDEKKIEEKKAQIWEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVT 52 5 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I | I I || | || | | | I | | 
Db 181 FPLLGDPTSENKTDEKKIEEKKAQI VTEKNTSTKTSNPFLVAAQDSETDYWTDNLTKVT 240 

QY 526 EEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPS 585 

I I I I I I I M I I I I I I I I I I I I I I I I I I 1 I I I I I II I I II I I I I II I I I I I I I I I I I I I I I 
Db 241 EEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPS 300 



Qy 586 FEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPY 645 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 1 II I I I I I I I I I I I I II I I I I I 
Db 301 FEESEATPSPVLPDIVTVIEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPY 360 

Qy 646 EEAMSVSL-KVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYS 704 

I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I M I I I I I I I I I II I I 
Db 361 EEAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYS 420 

Qy 705 EMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESM 764 

I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I II I I || || || I I I I I I I I I I | I I 
Db 421 EMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESM 480 

Qy 765 IEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAV 824 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I II I I I I I I I I I I I I I I I I I 
Db 481 IEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAV 540 

Qy 825 YSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSE 884 

I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I | | | | 
Db 541 YSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSE 600 

, Qy 885 IANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALG 944 

r V 1 1 ii 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii 1 1 1 1 1 1 1 1 1 1 1 

O r> Db 601 IANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALA 660 

/V, Qy 945 HTQAEI ES I VKPKVLEKEAEKKLPS DTEKEDRS PSAI FSADLGKT S WD^LYWRDI KKTG 1004 

>: I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I 

Db 661 -TQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSWDll] 



1 1 1 1 1 1 1 1 1 

LYWRDIKKTG 719 



Qy 1005 WFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLE 1064 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I II I I I I I I I I II I I I I I I II 

Db 720 WFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLE 779 

Qy 1065 SEVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGL 1124 

I II M I I I I I I I I 1 I I I II I I I I I I I I I I I I 1 I I I I II I I I I I I || I I I I I I I I I | | I | | 

Db 780 SEVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGL 839 



Qy 1125 TLLILALI SLFSVPVI YERHQAQI DHYLGLANKNVKDAMAKIQAKI PGLKRKAE 

I I I I I I I I II I II I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 
Db 84 0 TLLILALI SLFSVPVI YERHQAQI DHYLGLANKNVKDAMAKIQAKI PGLKRKAE 



1178 
893 



RESULT 12 
AAY71310 

ID AAY71310 standard; protein; 1163 AA. 
XX 

AC AAY71310; 
XX 

DT 02-NOV-2000 (first entry) 
XX 

DE Rat neurite growth inhibitor Nogo A. 
XX 

KW Rat; neurite growth inhibitor; Nogo A; neural cell; myelin; CNS; 

KW central nervous system; neoplastic disease; antiproliferative; glioma; 

KW antisense gene therapy; neuroblastoma; menagioma; retinoblastoma; 

KW degenerative nerve disease; Alzheimer's disease; Parkinson's disease; 

KW hyperproliferative disorder; benign dysprolif erative disorder; diagnosis; 

KW psoriasis; tissue hypertrophy; neuronal regeneration; treatment; 



KW 


structural plasticity; screening. 








XX 












OS 


Rattus sp. 










XX 












FH 


Key 


Location/Qualifiers 








FT 


Inhibitory- site 


1. .171 








FT 




/note= "Inhibits NIH 3T3 fibroblast spreading 1 ' 




FT 


Modified-site 


30 








FT 




/note= "Casein kinase II site 


IT 






FT 


Region 


31. .58 








FT 




/ note= "Acidic region" 








FT 


Region 


31. .57 








FT 




/note= "Region specifically described 


in claim 16 


T 


FT 


Region 


172. .259 








FT 




/note= "This region is not essential 


for inhibitory 


FT 




CIL/L1 VI L- y 








FT 


Modified-site 


233 








FT 




/nn1"P= "Profpi n Irin^c:*^ C I PKT* 


\ c -i +■ o " 
i oXLc 






FT 


Modi f ied-site 


*i ■ • ^ *i *i 








FT 




/noi"P= 11 A n i ci M — rr 1 \7 r* o \7 1 afpH 
/ noil _i_ o in u j. y i >uo y xa L-cu 


t 






FT 


Modified-site 


291 








FT 




/ ti n t~ F 1 — "Prntpi n V H n^c;p f (PKC 


1 bXLc 






FT 


Modified- site 


2 95 








FT 




/nnt"P= " P mfpi n lri na co P / p ire* 

f IIULC — IT-LvJ L.CXU J\. _L 1 1 CI O C ^ ^ E 1\0 


SlLc 






FT 


Misc-dif f erence 


404 








FT 




/note= "F.nroHpd bv TTH" 








FT 


Modi f ied-site 


436 








FT 




/notp= "Protpin H na^p C ( PKC 


site" 






FT 


Modi f ied-site 


468. .470 








FT 




/ not"P= "A<?n i s N— rrl vrn^vl ^t^H' 

/ iiv-/ l.v — 1 1 _i_ o in uj.yuWk3yxciL.Cvl 








FT 


Modi f ied-site 


484 








FT 




/note= "Protein Vi nasp C (PKC 


site" 






FT 


Modi f ied-site 


488 








FT 




/note= "Protein kinase C f PKC 


site " 






FT 


Modified- site 


502 








FT 




/note— "Casein kinase TT ^ifR 1 








FT 


Inhibitory- site 


542. .722 








FT 


Modi f ied-site 


576 








FT 




/note= "Casein kinase II site' 








FT 


Peptide 


623. .640 








FT 




/note= "used as immunogen to c 


jenerate 


antibody AS 


472" 


FT 


Modif ied-site 


626 








FT 




/note= "Protein kinase C (PKC) 


site" 






FT 


Modified- site 


694. .696 








FT 




/note= "Asn is N-glycosylated' 








FT 


Modif ied-site 


715 








FT 




/note= "Casein kinase II site' 








FT 


Peptide 


762. .1163 








FT 




/note= "used as immunogen to c 


jenerate 


antibody AS 


Bruna" 


FT 


Modif ied-site 


784 








FT 




/note= "Protein kinase C (PKC) 


site" 






FT 


Modif ied-site 


821 








FT 




/note= "Protein kinase C (PKC) 


site" 






FT 


Modif ied-site 


850 








FT 




/note= "Protein kinase C (PKC) 


site" 






FT 


Modif ied-site 


855 








FT 




/note= "Protein kinase C (PKC) 


site" 







FT Modified-site 863 

FT /note= "Casein kinase II site" 

FT Modified-site 868 

FT /note= "Protein kinase C (PKC) site" 

FT Modified-site 893 

FT /note= "Protein kinase C (PKC) site" 

FT Modified-site 912. .914 

FT /note= "Asn is N-glycosylated" 

FT Modified-site 925. .927 

FT /note= "Asn is N-glycosylated" 

FT Modified-site 954 

FT / n ote= "PKC and casein kinase II sites" 

FT Modified-site 956 

FT /note= "PKC and casein kinase II sites" 

FT Region 975. .1162 

FT /note= "This region is not essential for inhibitory 

FT activity" 

FT Region 976. .1163 

FT / n ote= "C-terminal common region found in Nogo A, B and C 

FT isoforms" 

FT Domain 988. .1023 

FT /label= Transmembrane_domain 

FT /note= "C-terminal hydrophobic region specifically 

FT described in claim 16" 

FT Modified-site 1024 

FT /note= "Protein kinase C (PKC) site" 

FT Modified-site 1071. .1073 

FT /note= "Asn is N-glycosylated" 

FT Modified-site 1073 

FT / no te= "Protein kinase C (PKC) site" 

FT Modified-site 1089 

FT /note= "Protein kinase C (PKC) site" 

FT Domain 1090. .1125 

FT /label= Transmembrane_domain 

FT /note= "C-terminal hydrophobic region specifically 

FT described in claim 16" 

FT Modified-site 1141. .1143 

FT /note= "Asn is N-glycosylated" 

FT Modified-site 1143 

FT /note- "Protein kinase C (PKC) site" 
XX 

PN WO200031235-A2. 
XX 

PD 02-JUN-2000. 
XX 

PF 05-NOV-1999; 99WO-US026160 . 
XX 

PR 06-NOV-1998; 98US-0107446P . 
XX 

PA (SCHW/) SCHWAB M E. 

PA (CHEN/) CHEN M S. 
XX 

PI Schwab ME, Chen MS; 
XX 

DR WPI; 2000-400052/34. 

DR N-PSDB; AAD01173. 
XX 



PT Nogo proteins and nucleic acids useful for treating neoplastic disorders 

PT of the central nervous system and inducing regeneration of neurons. 

XX 

PS Claim 3; Fig 2A; 122pp; English. 
XX 

CC The present sequence is a rat Nogo A protein which is a potent neural 

CC cell growth inhibitor and is free of all central nervous system (CNS) 

CC myelin material with which it is natively associated. The protein was 

CC derived from a cDNA generated by fusing R018U37-3, R1-3U21 cDNAs isolated 

CC from hexanucleotides-primed rat brain stem/spinal cord library, and 01118 

CC cDNA from an oligo d(T) -primed rat oligodendrocyte library. Nogo proteins 

CC and fragments displaying neurite growth inhibitory activity are used in 

CC the treatment of neoplastic disease of the CNS e.g. glioma, glioblastoma, 

CC medulloblastoma, craniopharyngioma, ependyoma, pinealoma, 

CC haemangioblastoma, acoustic neuroma, oligodendroglioma, menagioma, 

CC neuroblastoma or retinoblastoma and degenerative nerve diseases e.g. 

CC Alzheimer ! s and Parkinson ! s diseases. Therapeutics which promote Nogo 

CC activity can be used to treat or prevent hyperprolif erative or benign 

CC dysprolif erative disorders e.g. psoriasis and tissue hypertrophy. 

CC Ribozymes or antisense Nogo nucleic acids can be used to inhibit 

CC production of Nogo protein to induce regeneration of neurons or to 

CC promote structural plasticity of the CNS in disorders where neurite 

CC growth, regeneration or maintenance are deficient or desired. The animal 

CC models can be used in diagnostic and screening methods for predisposition 

CC to disorders and to screen for or test molecules which can treat or 

CC prevent disorders or diseases of the CNS. Note: The present sequence 

CC designated as SEQ ID NO: 2 is stated to be the same as the sequence shown 

CC in Fig. 13 (see AAY71384) of the specification. However, this sequence 

CC does not match the sequence given in Fig. 13. SEQ ID numbers 35-42 are 

CC referred in claim 32 and SEQ ID NO: 29 in disclosure of the 

CC specification. However, the specification does not include sequences for 

CC these SEQ ID numbers 

XX 

SQ Sequence 1163 AA; 

Query Match 72.5%; Score 4296.5; DB 3; Length 1163; 

Best Local Similarity 74.0%; Pred. No. 6.2e-214; 

Matches 885; Conservative 104; Mismatches 156; Indels 51; Gaps 19; 

Qy 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

Mhlll I I I I I 11 I I I I I I I I I I I I I I I I I I I I I I ! : I I :: I M I I I I I I I I I ! 
Db 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

Qy 59 P AAGL S AAP VPT APAAGAP LMD FGND FVP PAP RGP L P AAP P VAP ERQ P S WD P S P VS S T VP 118 

I I I I I I I I II III I I I : I I : I I I I I II I I I I I I I i I I I I I I I I : II : I 
Db 61 PAAGL S AAAVP — P AAAAP LLDFSSDSVP PAP RG P L P AAP P AAP E RQ P S W E R S P AA P 115 

Qy 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPWTPPAPAPAAPPSTPAAPKR 178 

III I II I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE P AAP P S T P AAP KR 166 

Qy 179 RGSSG AWXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSL 224 

Ml : I I I I I II I I I I : I : I I I I I I I I I I I I I I I I I I 

Db 167 RGSGSVDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSL 22 6 

Qy 225 SPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYS 284 

I I I I I I II I I I I I I I I : I I I : : I : : I I I I I : I : | : : I I I I I I I I II I 



Db 227 SPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYS 286 

Qy 285 EMGSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDE 344 

I I I I I I I I I I I I : : I I : I I : I I : : I I : I : I I : II III Ml 
Db 2 87 EMGS S FKGS PKGESAI LVENTKEEVI VRS KDKED-LVCSAALHS PQES PVG KEDR 34 0 

Qy 345 WSSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDS-DMLAAGGKIESNLE 4 03 

III It I I I I : : : I I I : I I M II I I II I : M I II : I I I : I I I : I : I 

Db 341 WSPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAA RANVE 396 

Qy 4 04 SKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIAT 463 

Ml:: Mill: MM I : I I I I I II I : I I I M I II I I Mill 
Db 397 SKVDRKCIEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCAS FT-SATESTTA 455 

Qy 4 64 NIFPLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTK 523 

I I I I I I I I I II I I II II I I M I M M I I II I I I I I I II II II I I I I II Ml 

Db 4 56 NTFPLLEDHTSENKTDEKKIEERKAQIITEK-TSPKTSNPFLVAVQDSEADYVTTDTLSK 514 

Qy 524 VTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLC 583 

II I I M I I I I I I I I I 1 II I I I II I I I I I II II I II I M I I I I II : I I II I I I I I I 

Db 515 VTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQLC 574 

Qy 584 PSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPENP 642 

II M I : II I II II I I I I I II I I II I M I I I M I M II I I I I I |:|:||| Mill 
Db 575 PSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPENP 634 

Qy 643 PPYEEAMSVSLKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSD 7 02 

II M I I I : I M I I M IIMM I I I : I M I II I II I II I I I I II I II I I : I II I : 
Db 635 PPYEEAMNVALKALGTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDFSN 694 

Qy 703 YSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFE 762 

IIMM I : I M M M M I I I I : I II I I I I II II I I : I I I Ml I I M II I I II I I 
Db 695 YSEIAKFEKSVPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS-E 753 

Qy 763 SMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELST 822 

: : : : : =1 = 111 I I I II I II I I : : I MM : : : II : I I I II Mill : I 

Db 754 TVAQHK-EERLSAS PQELGKPYLES FQPNLHSTKDA-ASNDI PTLTKKEKI SLQMEEFNT 811 

Qy 823 AVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHK 882 

M II I I I I III : I : I : I II I I I I II I I I II I M MM II I I I : I II I II I I I 
Db 812 AIYSNDDLLSSKEDKIKESETFSDSSPIEIIDEFPTFVSAKDDS-PKLAKEYTDLEVSDK 870 

Qy 8 83 SEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSA 942 

I I I I I M I I I I I II I II II I I I : : : I I : I I : I I : || : I : I II 
Db 871 SEIANIQSGADSLPCLELPCDLSFKNIYPK — DEVHVSDEFSENRSSVSKASISPSNVSA 928 

Qy 943 LGHTQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKK 1002 

I I I : I I I I I I I II I I I I II II II II II IM MM II II II I I I II I I I I 
Db 929 L-EPQTEMGSIVKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIKK 987 

Qy 1003 TGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAY 1062 

I I I I M M I M I I I I I I M I I I II I I II I I I I I I I II I II I II I || I I I II II I I II I I I 
Db 988 TGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAY 1047 

Qy 1063 LESEVAISEELVQKYSNS7\IiGHWCTIKELRRLFLVDDLVDSLKFAVLMWFTWGALFN 1122 

I M I II I I II I I I I II II I II I I I I I I I II I II I I I I I I II II I I II I II I I I I 

Db 1048 LESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFN 1107 



Qy 1123 GLTLLIIALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1178 

I I I I I I I I II I I I I : I I I I I I I I I I I I II I I I I I : I I I I I I I It I I I II I I II I : 
Db 1108 GLTLLILALISLFSIPVIYERHQVQIDHYLGIANKSVKDAMAKIQAKIPGLKRKAD 1163 



RESULT 13 
ABB81074 

ID ABB81074 standard; protein; 1163 AA. 
XX 

AC ABB81074; 
XX 

DT 05-NOV-2002 (first entry) 
XX 

DE Rat neurotransmitter receptor protein Nogo-A. 
XX 

KW Nerve regeneration; neuroprotection; neuronal degeneration; CNS; PNS; 

KW central nervous system; peripheral nervous system; tranquillizer; Nogo; 

KW vulnerary; cerebroprotective ; anti-tumour; antidiabetic; anticonvulsant; 

KW nootropic; antiparkinsonian; ophthalmological ; analgesic; hepatotropic; 

KW osteopathic; vasotropic; nephrotropic; cytostatic; antigen; gene therapy; 

KW neurotransmitter receptor; rat; receptor. 
XX 

OS Rattus norvegicus. 
XX 

PN US2002072493-A1. 
XX 

PD 13-JUN-2002. 
XX 

PF 28-JUN-2001; 2001US-00893348 . 
XX 

PR 19-MAY-1998 



PR 21-JUL-1998 
PR 22-DEC-1998 
PR 19-MAY-1999 



98IL-00124500. 
98WO-US014715. 
98US-00218277. 
99US-00314161. 



XX 

PA (YEDA ) YE DA RES & DEV CO LTD. 
XX 

PI Eisenbach-Schwartz M, Hauben E, Cohen IR, Beserman P, Mosonego A; 

PI Moalem G; 

XX 

DR WPI; 2002-607255/65. 

DR N-PSDB; ABN86600. 
XX 

PT Promoting nerve regeneration and preventing neuronal degeneration in the 

PT central/peripheral nervous system from injury/disease, comprises 

PT administering nervous system-specific activated T cells/antigen, or 

PT analogs/peptides . 

XX 

PS Example 5; Page 44-47; 93pp; English. 
XX 

CC The invention relates to promoting nerve regeneration or conferring 

CC neuroprotection and preventing or inhibiting neuronal degeneration in the 

CC central/peripheral nervous system (NS) . The method involves administering 

CC NS-specific activated T cells, NS-specific antigen, its analogue or its 

CC peptide, a nucleotide sequence the NS-specific antigen or its analogue or 

CC combinations. The method is useful for promoting nerve regeneration and 



CC preventing neuronal degeneration in central/peripheral nervous system 

CC from injury/disease, where the injury is spinal cord injury, blunt 

CC trauma, penetrating trauma, hemorrhagic stroke, ischaemic stroke or 

CC damages caused by surgery such as tumour excision. The disease is not an 

CC autoimmune disease or neoplasm. The disease results in a degenerative 

CC process occurring in either gray or white matter or both. The disease is 

CC diabetic neuropathy, senile dementia, Alzheimer's disease, Parkinson's 

CC disease, facial nerve (Bell's) palsy, glaucoma, Huntington's chorea, 

CC amyotrophic lateral sclerosis, non-arteritic optic neuropathy, and 

CC vitamin deficiency, intervertebral disc herniation, prion diseases such 

CC as Creut z feldt— Jakob disease, carpal tunnel syndrome, peripheral 

CC neuropathies associated with various diseases, including but not limited 

CC to uremia, porphyria, hypoglycemia, Sjorgren Larsson syndrome, acute 

CC sensory neuropathy, chronic ataxic neuropathy, biliary cirrhosis, primary 

CC amyloidosis, obstructive lung diseases, acromegaly, malabsorption 

CC syndromes, polycythemia vera, immunoglobulin (Ig)A- and IgG gamma - 

CC pathies, complications of various drugs (e.g., metronidazole) and toxins 

CC (e.g., alcohol or organophosphates ) , Charcot-Marie-Tooth disease, ataxia 

CC telangectasia, Friedreich's ataxia, amyloid polyneuropathies, 

CC adrenomyeloneuropathy, Giant axonal neuropathy, Ref sum's disease, Fabry's 

CC disease, or lipoproteinemia . The present sequence represents the rat 

CC neurotransmitter receptor protein Nogo-A, an example of NS-specific 

CC antigen 

XX 

SQ Sequence 1163 AA; 



Query Match 72.5%; Score 4296.5; DB 5; Length 1163; 

Best Local Similarity 74.0%; Pred. No. 6.2e-214; 

Matches 885; Conservative 104; Mismatches 156; Indels 51; Gaps 19 

Qy 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

M I • I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I : I I :: I II M I I I I I II I 
Db 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

Qy 59 P AAGL S AAP VP T APAAGAP LMD FGN D FVP PAP RG P L PAAP P VAP E RQ P S W D P S P VS S T VP 118 

I I I I I I I I II III I I I : II : I | | | | | | | | | | || | | | | | | | | || : | | : | 
Db 61 PAAGLSAAAVP—PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAA P 115 

Qy 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 17 8 

Ml I I I I I I I I I I I I I I || I I I I || | | | | | | I I I I I I I I I I I I I 

Db 116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAP P S T PAAP KR 166 



Qy 179 RGSSG AWXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSL 224 

Ml : I I I I I I I M I I : I : I II I I I I I I I I I I I I I II 

Db 167 RGSGSVDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSL 226 

Qy 225 SPLSAASFKEHEYLGNLSTVLPTEGTLQENVS EASKEVSEKAKTLLIDRDLTEFSELEYS 284 

I I I I M I I I I I I I I I I : | | | : : | : : | | | | | : I : I :: I I I I I I I I II I 

Db 227 SPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYS 286 



QY 285 EMGSSFSVSPKAESAV1VANPREEIIVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDE 344 

IN' I I I I I I : : I I : I I : I I : : I I : I : I I : II III II I 

Db 287 EMGSSFKGSPKGESAI LVENTKEEVIVRSKDKED-LVCSAALHSPQESPVG KEDR 340 

Qy 345 WSSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDS-DMLAAGGKIESNLE 4 03 

I I I M I Ml : : : I I I : I I I I I I I I I I I : III,: | | | : | | | : | : | 

Db 341 WSPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAA RANVE 396 



Qy 4 04 SKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIAT 4 63 

Nihil Mill: I I I I I : I I I I I I I I : I I I I I II I I I : I I | | 
Db 397 SKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCAS FT-SATESTTA 455 

Qy 464 NIFPLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTK 523 

I I I I I I I II I I I I I I I I I I : II I I : I I I II I I I I I I II I I I I MINI I : I 
Db 456 NTFPLLEDHTSENKTDEKKIEERKAQIITEK-TSPKTSNPFLVAVQDSEADYVTTDTLSK 514 

Qy 524 VTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLC 583 

Ml I : I I I I I I M I I I I I I II I I I II I I I I I I I I I I : I I || I || : I I I I I I I I I I 
Db 515 VTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQLC 574 

Qy 584 PSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPENP 642 

I I I I I : M I I I II I I I I I I I I I I I I : II I I II I : I I I I I I I I I : I : I I I Mill 
Db 575 PSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPENP 634 

Qy 643 PPYEEAMSVSLKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSD 702 

I I I M I I : I : I I I II Mill: I I I : I I I I II M II I II II II I I M I I : I I I I : 
Db 635 PPYEEAMNVALKALGTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDFSN 694 

Qy 703 YSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFE 762 

I M : I I M I I : I : I II I II I I : M I I I I I I I II II : II I I : I I I I : I II I I I I I 
Db 695 YSEIAKFEKSVPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS-E 753 

Qy 763 SMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELST 822 

: : : : : =1 = 111 I I I II I II I I : = I MM : : : I I M I II I I I I I I : I 

Db 754 TVAQHK-EERLSASPQELGKPYLESFQPNLHSTKDA-ASNDIPTLTKKEKISLQMEEFNT 811 

Qy 823 AVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHK 882 

I M I I I II III M M M I I I I I II I II I I I I I I MM II I II M II I I I I I I 

Db 812 AIYSNDDLLSSKEDKIKESETFSDSSPIEIIDEFPTFVSAKDDS-PKLAKEYTDLEVSDK 870 

Qy 883 SEI7\NAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSA 942 

II I II II I II I I I I I I I II I II : : : I I M I M I : II : I M I I 

Db 871 SEIANIQSGADSLPCLELPCDLSFKNIYPK — DEVHVSDEFSENRSSVSKASISPSNVSA 928 

Qy 943 LGHTQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKK 1002 

I I I : I I M I I II I I I II I I I I I I I II I II: MM I I I II | | | | | || I M 
Db 929 L-EPQTEMGSIVKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIKK 987 

Qy 1003 TGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAY 1062 

M II II II M I I II II I II I I I II I II II I I I II I I II I I I I I I M I II II II I I I I I II 
Db 988 TGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAY 1047 

Qy 1063 LESEVAISEELVQKYSNS7VLGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFN 1122 

I M I I I M I I I M M II II I II I I I I I II II II I II I I I I II II I I II I I II I II I I II 
Db 104 8 LESEVAISEELVQKYSNSALGHWSTIKELRRLFLVDDLVDSLKFAVLMWFTYVGALFN 1107 

Qy 1123 GLTLLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1178 

I M II II I II II II M I I II I M I I I II I I II I I M II I I I II II II I II II I I : 
Db 1108 GLTLLIIALISLFSIPVIYERHQVQIDHYLGLANKSVKDAMAKIQAKIPGLKRKAD 1163 

RESULT 14 
AAY71557 

ID AAY71557 standard; protein; 1162 AA. 



XX 

AC AAY71557; 
XX 

DT 02-NOV-2000 (first entry) 
XX 

DE Rat Nogo A truncated protein used in the construction of mutant Nogo-A. 
XX 

KW Rat; neurite growth inhibitor; Nogo A; neural cell; myelin; CNS; 

KW central nervous system; neoplastic disease; antiproliferative; glioma; 

KW antisense gene therapy; neuroblastoma; menagioma; retinoblastoma; 

KW degenerative nerve disease; Alzheimer's disease; Parkinson's disease; 

KW hyperprolif erative disorder; benign dysprolif erative disorder; diagnosis; 

KW psoriasis; tissue hypertrophy; neuronal regeneration; treatment; 

KW structural plasticity; screening; mutant; mutein. 

XX 

OS Rattus sp. 
XX 

PN WO200031235-A2. 
XX 

PD 02-JUN-2000. 
XX 

PF 05-NOV-1999; 99WO-US02 6160 . 
XX 

PR 06-NOV-1998; 98US-0107446P . 
XX 

PA (SCHW/) SCHWAB M E. 

PA (CHEN/) CHEN M S. 
XX 

PI Schwab ME, Chen MS; 
XX 

DR WPI; 2000-400052/34. 
XX 

PT Nogo proteins and nucleic acids useful for treating neoplastic disorders 

PT of the central nervous system and inducing regeneration of neurons. 

XX 

PS Example; Page; 122pp; English. 
XX 

CC The patent relates to neurite growth inhibitor Nogo which is free of all 

CC central nervous system (CNS) myelin material with which it is natively 

CC associated. Nogo proteins and fragments displaying neurite growth 

CC inhibitory activity are used in the treatment of neoplastic disease of 

CC the CNS e.g. glioma, glioblastoma, medulloblastoma, craniopharyngioma, 

CC ependyoma, pinealoma, haemangioblas toma, acoustic neuroma, 

CC oligodendroglioma, menagioma, neuroblastoma or retinoblastoma and 

CC degenerative nerve diseases e.g. Alzheimer's and Parkinson's diseases. 

CC Therapeutics which promote Nogo activity can be used to treat or prevent 

CC hyperprolif erative or benign dysprolif erative disorders e.g. psoriasis 

CC and tissue hypertrophy. Ribozymes or antisense Nogo nucleic acids can be 

CC used to inhibit production of Nogo protein to induce regeneration of 

CC neurons or to promote structural plasticity of the CNS in disorders where 

CC neurite growth, regeneration or maintenance are deficient or desired. The 

CC animal models can be used in diagnostic and screening methods for 

CC predisposition to disorders and to screen for or test molecules which can 

CC treat or prevent disorders or diseases of the CNS. The present sequence 

CC is a truncated form of rat Nogo A protein shown in AAY71310, which is 

CC used in the construction of mutant Nogo-A. Nogo-A is composed of His- 

CC tag/T7-tag/vector/Nogo-A sequence aa 1-1162. Nogo A deletion mutants were 



CC used for mapping the inhibitory sites of Nogo protein. Major inhibitory 

CC region was identified in the Nogo A sequence from amino acids 172-974, 

CC particularly amino acids 542-722. In addition, N-terminal region 1-171 

CC was found to be inhibitory to NIH 3T3 fibroblast spreading. Note: The 

CC present sequence is not given in the specification but is derived from 

CC rat Nogo A sequence shown in AAY71310. SEQ ID numbers 35-42 are referred 

CC in claim 32 and SEQ ID NO: 29 in disclosure of the specification, 

CC However, the specification does not include sequences for these SEQ ID 

CC numbers 
XX 

SQ Sequence 1162 AA; 

Query Match 72.5%; Score 4294.5; DB 3; Length 1162; 

Best Local Similarity 74.1%; Pred. No. 7.9e-214; 

Matches 885; Conservative 103; Mismatches 156; Indels 51; Gaps 19 

Qy 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

I M : I II I I II I I I I I I I I I I I I II I I I I I II I I I I I : I I :: I I II I I I I I I I I I 
Db 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

Qy 59 PAAGLSAAPVPTAPAAGAPLMDFGNDFVP PAPRGPLPAAPPVAPERQPSWDPSPVSSTVP 118 

I I M I I I I II III I II : I I : I I I I I I I I I II I I I I I I I I I I I I : I I : I 
Db 61 PAAGLSAAAVP — PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAA P 115 

Qy 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPWTPPAPAPAAPPSTPAAPKR 178 

Ml I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 

Db 116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAPPSTPAAPKR 166 

Qy 179 RGSSG AWXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSL 224 

I M : I I I M I I I I I I : I : I I I I I I I II I I I I I I I I I 

Db 167 RGSGSVDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSL 226 

Qy 225 SPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYS 284 

I I I I I II I I II I I I I I : I I I : : ! : : | | | | | : | : | :: I I I I I I I I I I I 
Db 22 7 SPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYS 2 86 

Qy 285 EMGS S FS VS PKAESAVI VANPREEI I VKNKDEEEKLVSNNI LHXQQELPTALTKLVKEDE 344 

I I I I I I I II I I I :: I I : I I : I I :: I I : I : I I : II II I III 
Db 287 EMGS S FKGS P KGE SAI LVENTKEEVI VRS KDKED- LVC S AALHS PQES P VG KEDR 340 

Qy 345 WSSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDS-DMLAAGGKIESNLE 403 

I I I I I I I I I ::: I I I : I I I I I I I I I I I : I I I I I : I I I : M I : I : I 

Db 341 WSPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAA RANVE 396 

Qy 404 SKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIAT 463 



Db 



i i i i * • i - i i i i i*i i ii i i ii • i i i i i i i i I i - i i i i 

397 SKVDRKCIEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCAS FT-SATESTTA 455 



QY 



464 NIFPLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTK 523 



' i f t i i i ii I I I I • I I I I • I I I t I I I I I I I II I I I I Mill) I - I 

456 NTFPLLEDHTSENKTDEKKIEERKAQIITEK-TSPKTSNPFLVAVQDSEADYVTTDTLSK 514 




Db 



Qy 



524 VTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLC 583 



Db 




Qy 



584 PSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPENP 642 



Db 


575 


PSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPENP 


634 


Qy 


643 


PPYEEAMSVSLKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSD 
M 1 1 1 1 1 : 1 : 1 1 1 II Mill: 1 1 I : 1 II 1 1 1 1 1 1 I M 1 1 1 1 1 1 1 1 1 1 1 : 1 I I 1 : 
PPYEEAMNVALKALGTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDFSN 


702 


Db 


635 


694 


Qy 


703 


YSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFE 
M 1 : 1 1 1 : I 1 : 1 : 1 1 1 1 1 1 i 1 : 1 1 i 1 II 1 1 1 1 1 1 1 : 1 1 I 1 : 1 1 II : 1 1 1 1 1 1 1 1 
YSEIAKFEKSVPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS-E 


762 


Db 


695 


753 


Qy 


763 


SMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELST 
: : : : : : 1 : 1 1 1 1 1 1 II II M 1 : : 1 : 1 1 1 : : : 1 1 : 1 1 II 1 1 1 1 1 1 : 1 
TVAQHK-EERLSASPQELGKPYLESFQPNLHSTKDA-ASNDIPTLTKKEKISLQMEEFNT 


822 


Db 


754 


811 


Qy 

Db 


823 
812 


AVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHK 
1 : 1 1 1 1 1 1 IN : 1 : 1 : 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 : I : | i 1 1 1 1 : 1 1 1 1 1 1 I | i 
AIYSNDDLLSSKEDKIKESETFSDSSPIEIIDEFPTFVSAKDDS-PKLAKEYTDLEVSDK 


882 
870 


Qy 


883 


SEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSA 
1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : : : II : I I : | | : | | : | : | | | 
SEIANIQSGADSLPCLELPCDLSFKNIYPK--DEVHVSDEFSENRSSVSKASISPSNVSA 


942 


Db 


871 


928 


Qy 


943 


LGHTQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFS7U)LGKTSWDLLYWRDIKK 
1 1 1 : 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 II: 1 1 : 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 
L-EPQTEMGSIVKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIKK 


1002 


Db 


929 


987 


Qy 

Db 


1003 
988 


TGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAY 
1 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I I I I I I | | | | | | | | | | || | | || | | | | | | | | | 
T GWFGAS L FLLLS LTVFS I VS VTAYI ALALLS VT I S FRI YKGVI QAI QKS DEGH P FRAY 


1062 
1047 


Qy 

Db 


1063 
1048 


LESEVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTWGALFN 
1 1 M 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I | | | || | | | | | | | M || | | | | || | | | | 
LES EVAI S EELVQKYSN S ALGHVNS T I KELRRL FLVDDLVD S LKFAVLMWVFT YVGALFN 


1122 
1107 


Qy 

Db 


1123 
1108 


GLTLLI LALI SLFSVPVI YERHQAQI DHYLGLANKNVKDAMAKIQAKI PGLKRKA 1177 
M 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 : 1 1 1 1 1 1 1 1 I I I I I I I I | | | 
GLTLLI LALI SLFS I PVIYERHQVQ I DHYLGIANKSVKDAMAKIQ7VKI PGLKRKA 1162 





RESULT 15 
AAY71384 

ID AAY71384 standard; protein; 1163 AA. 
XX 

AC AAY71384; 
XX 

DT 02-NOV-2000 (first entry) 
XX 

DE Alternative version of rat neurite growth inhibitor Nogo A. 
XX 

KW Rat; neurite growth inhibitor; Nogo A; neural cell; myelin; CNS; 

KW central nervous system; neoplastic disease; antiproliferative; glioma; 

KW antisense gene therapy; neuroblastoma; menagioma; retinoblastoma; 

KW degenerative nerve disease; Alzheimer's disease; Parkinson's disease; 

KW hyperproliferative disorder; benign dysprolif erative disorder; diagnosis; 

KW psoriasis; tissue hypertrophy; neuronal regeneration; treatment; 

KW structural plasticity; screening. 



XX 

OS Rattus sp. 
XX 

FH Key Location/Qualifiers 

FT Inhibitory-site 1. .171 

FT /note- "Inhibits NIH 3T3 fibroblast spreading" 

FT Modified-site 30 

FT /note= "Casein kinase II site" 

FT Region 31. .58 

FT /note- "Acidic region" 

FT Region 172. .259 

FT /note- "This region is not essential for inhibitory 

FT activity" 

FT Misc-dif ference 223 

FT /label= Unknown 

FT /note= "There is Leu at this position in the sequence 

FT shown in AAY71310" 

FT Modified-site 233 

FT /note- "Protein kinase C (PKC) site" 

FT Modified-site 242. .244 

FT /note= "Asn is N-glycosylated" 

FT Modified-site 291 

FT /note= "Protein kinase C (PKC) site" 

FT Modified-site 295 

FT /note= "Protein kinase C (PKC) site" 

FT Misc-dif ference 404 

FT /note= "There is lie at this position in the sequence 

FT shown in AAY71310" 

FT Modified-site 436 

FT /note- "Protein kinase C (PKC) site" 

FT Modified-site 468. .470 

FT /note= "Asn is N-glycosylated" 

FT Misc-dif ference 469 

FT /label= Unknown 

FT /note= "There is Lys at this position in the sequence 

FT shown in AAY71310" 

FT Modified-site 484 

FT /note- "Protein kinase C (PKC) site" 

FT Modified-site 488 

FT /note- "Protein kinase C (PKC) site" 

FT Modified-site 502 

FT /note- "Casein kinase II site" 

FT Inhibitory-site 542. .722 

FT Modified-site 576 

FT /note- "Casein kinase II site" 

FT Peptide 623. .640 

FT /note= "used as immunogen to generate antibody AS 472" 

FT Modified-site 626 

FT /note- "Protein kinase C (PKC) site" 

FT Misc-dif ference 661 

FT /note= "There is Asn at this position in the sequence 

FT shown in AAY71310" 

FT Modified-site 694. .696 

FT /note- "Asn is N-glycosylated" 

FT Modified-site 715 

FT /note- "Casein kinase II site" 

FT Peptide 762. .1163 



FT /note= "used as immunogen to generate antibody AS Bruna" 

FT Modified-site 784 

FT /note= "Protein kinase C (PKC) site" 

FT Misc-dif ference 820 

FT /note= "There is Leu at this position in the sequence 

FT shown in AAY71310" 

FT Modified-site 821 

FT /note= "Protein kinase C (PKC) site" 

FT Modified-site 850 

FT /note= "Protein kinase C (PKC) site" 

FT Modified-site 855 

FT /note- "Protein kinase C (PKC) site" 

FT Modified-site 863 

FT /note= "Casein kinase II site" 

FT Modified-site 868 

FT /note- "Protein kinase C (PKC) site" 

FT Modified-site 893 

FT /note= "Protein kinase C (PKC) site" 

FT Modified-site 912. .914 

FT /note- "Asn is N-glycosylated" 

FT Modified-site 925. .927 

FT /note- "Asn is N-glycosylated" 

FT Modified-site 954 

FT /note= "PKC and casein kinase II sites" 

FT Modified-site 956 

FT /note= "PKC and casein kinase II sites" 

FT Region 975. .1162 

FT /note- "This region is not essential for inhibitory 

FT activity" 

FT Region 976. .1163 

FT /note- "C-terminal common region found in Nogo A, B and C 

FT isoforms" 

FT Domain 988. .1023 

FT /label- Transmembrane_domain 

FT /note- "C-terminal hydrophobic region" 

FT Modified-site 1024 

FT /note- "Protein kinase C (PKC) site" 

FT Modified-site 1071. .1073 

FT /note- "Asn is N-glycosylated" 

FT Modified-site 1073 

FT /note- "Protein kinase C (PKC) site" 

FT Modified-site 108 9 

FT /note- "Protein kinase C (PKC) site" 

FT Domain 1090. .1125 

FT /label- Transmembrane_domain 

FT /note- "C-terminal hydrophobic region" 

FT Modified-site 1141. .1143 

FT /note- "Asn is N-glycosylated" 

FT Modified-site 114 3 

FT /note= "Protein kinase C (PKC) site" 
XX 

PN WO200031235-A2. 
XX 

PD 02-JUN-2000. 
XX 

PF 05-NOV-1999; 99WO-US026160 . 
XX 



PR 06-NOV-1998; 98US-010744 6P . 
XX 

PA (SCHW/) SCHWAB M E . 

PA (CHEN/) CHEN M S. 
XX 

PI Schwab ME, Chen MS; 
XX 

DR WPI; 2000-400052/34. 
XX 

PT Nogo proteins and nucleic acids useful for treating neoplastic disorders 

PT of the central nervous system and inducing regeneration of neurons. 

XX 

PS Claim 3; Fig 13; 122pp; English. 
XX 

CC The present sequence is an alternative version of rat Nogo A protein 

CC which is a potent neural cell growth inhibitor and is free of all central 

CC nervous system (CNS) myelin material with which it is natively 

CC associated. Nogo proteins and fragments displaying neurite growth 

CC inhibitory activity are used in the treatment of neoplastic disease of 

CC the CNS e.g. glioma, glioblastoma, medulloblastoma, craniopharyngioma, 

CC ependyoma, pinealoma, haemangioblas toma, acoustic neuroma, 

CC oligodendroglioma, menagioma, neuroblastoma or retinoblastoma and 

CC degenerative nerve diseases e.g. Alzheimer's and Parkinson's diseases. 

CC Therapeutics which promote Nogo activity can be used to treat or prevent 

CC hyperprolif erative or benign dysprolif erative disorders e.g. psoriasis 

CC and tissue hypertrophy. Ribozymes or antisense Nogo nucleic acids can be 

CC used to inhibit production of Nogo protein to induce regeneration of 

CC neurons or to promote structural plasticity of the CNS in disorders where 

CC neurite growth, regeneration or maintenance are deficient or desired. The 

CC animal models can be used in diagnostic and screening methods for 

CC predisposition to disorders and to screen for or test molecules which can 

CC treat or prevent disorders or diseases of the CNS. Note: The present 

CC sequence is an alternative version of the Nogo A sequence shown in Fig. 

CC 2A (see AAY71310) . SEQ ID numbers 35-42 are referred in claim 32 and SEQ 

CC ID NO: 29 in disclosure of the specification. However the specification 

CC does not include sequences for these SEQ ID numbers 

XX 

SQ Sequence 1163 AA; 



Query Match 72.4%; Score 4286.5; DB 3; Length 1163; 

Best Local Similarity 74.1%; Pred. No. 2.1e-213; 

Matches 886; Conservative 104; Mismatches 155; Indels 51; Gaps 19 

Qy 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I :: I II I I I I II I I I I 
Db 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

Qy 59 PAAGL S AAP VP T APAAGAP LMD FGN D FVP PAP RGP L PAAP PVAP E RQ P S WD P S P VS S T VP 118 

I I I II I I I II III II I : II : I I I I I I I I I II I I I I I I I I I I I I : II ■ I 
Db 61 PAAGLSAAAVP— PAAAAPLLDFSSDSVP PAP RGPL PAAP PAAPERQPSWERSPAA P 115 

Qy 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 17 8 

I M I I I I I I I I I I I I I I I I I II I I I I I I I I I I || i I M I I I I I I 

Db 116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAPPSTPAAPKR 166 



Qy 179 RGSSG AWXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSL 224 

I I I : I I I M M I I 1 I : I : M I I I I I I I I I I I I I I I I I 



UD 


lb / 


RGSGSVDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASXPSL 


226 


Qy 


O O R 


SPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYS 
1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 I I : : | : : | | | | | : | : | ::,',! I I I 1 1 1 
SPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYS 


284 


nL 
UD 


O O *~7 

22 / 


286 


Qy 


o o c 

285 


EMGS S FSVS PKAESAVIVANPREEI I VKNKDEEEKLVSNNI LHXQQELPTALTKLVKEDE 
1 1 M 1 1 1 1 1 II 1 : : 1 1 : 1 1 : 1 1 : : 1 1 : 1 : 1 1 : II III III 
EMGS S FKGS PKGESAI LVENTKEEVIVRSKDKED-LVCSAALHS PQES PVG KEDR 


344 


DD 


O O 1 

287 


340 


Qy 


345 


WSSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDS-DML7\AGGKIESNLE 
1 1 1 1 1 1 1 1 1 ::: 1 M : 1 1 1 1 1 1 1 1 1 1 1 : 1 1 1 I I : I | | : | | | : | : | 
WSPEKTMDIFNEMQMSWAPVREEY7VDFKPFEQAWEVKDTYEGSRDVLAA RANVE 


403 


Db 


341 


396 


Qy 


4 04 


SKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPT^TESIAT 
1 1 1 1 : M Mill : II 1 1 1 : I II 1 II 1 1 M 1 1 II II 1 I I Mill 
SKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCAS FT-SATESTTA 


463 


UD 


397 


455 


Qy 


464 


NIFPLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTK 

1 M 1 M 1 1 1 1 1 1 1 1 1 1 II 1 1 M 1 II M 1 1 II 1 1 1 II 1 II II 1 1 1 M 1 1 1 1 M 

NTFPLLEDHTSENXTDEKKIEERKAQIITEK-TSPKTSNPFLVAVQDSEADYVTTDTLSK 


523 




4bo 


514 


Qy 

Db 


524 
515 


VTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLC 

1 1 1 1 M II 1 II II 1 II II II II 1 II 1 1 1 1 II 1 1 II 1 M II II M M 1 II 1 1 MM 

VTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQLC 


583 
574 


Qy 

Db 


584 
575 


PSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPENP 

1 1 1 1 1 M II 1 1 1 II II 1 1 II 1 1 M 1 M II 1 1 II M II Mill 1 M M II II 1 II 

PSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPENP 


642 
634 


Qy 

UD 


643 
635 


PPYEEAMSVSLKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSD 

1 1 1 1 1 1 1 M M 1 1 II Mill: 1 1 M 1 1 II 1 II II 1 1 II I II II M II M II 1 : 
PPYEE1AMNVALKALGTKEGIKEPESFMAAVQETEAPYISIACDLIKETKLSTEPSPDFSN 


702 
694 


Qy 


703 


YSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFE 
1 1 1 M 1 1 : 1 1 M M II II 1 II M 1 1 1 II 1 1 II 1 1 1 M II 1 : 1 II 1 M II 1 II 1 1 
YSEIAKFEKSVPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS-E 


762 


Db 


695 


753 


Qy 


763 


SMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELST 
: : : : : M M II 1 1 II II II M : : 1 MM : : : M : 1 M II 1 II II M 
TVAQHK-EERLSASPQELGKPYLESFQPNLHSTKDA-ASNDIPTLTKKEKISLQMEEFNT 


822 


Db 


754 


811 


Qy 

Db 


823 
812 


AVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHK 

1 M M II 1 III M M M II 1 II M 1 1 II 1 II II MM II 1 II M II II 1 1 1 1 

AIYSNDDLSSSKEDKIKESETFSDSSPIEIIDEFPTFVSAKDDS-PKLAKEYTDLEVSDK 


882 
870 


Qy 


8 83 


SEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSA 
1 1 II 1 1 1 M 1 1 II 1 II 1 M 1 M : : : II M 1 M 1 : 1 1 : 1 M II 
SEIANIQSGADSLPCLELPCDLSFKNIYPK — DEVHVSDEFSENRSSVSKASISPSNVSA 


942 


Db 


871 


928 


Qy 


943 


LGHTQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKK 
1 1 M MM 1 1 1 1 M 1 1 II 1 II 1 M 1 II II: MM I II 1 II 1 1 II 1 1 1 II 
L-EPQTEMGSIVKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIKK 


1002 


Db 


929 


987 


Qy 

Db 


1003 
988 


TGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAY 
1 1 M 1 1 II 1 II 1 1 1 1 II 1 II || M II 1 1 II II II II 1 1 1 II 1 II 1 1 II 1 II II II 1 1 1 1 1 
T GWFGAS LFLLLS LTVFS I VS VTAYI ALALLS VT I S FRI YKGVI QAI QKS DEGH P FRAY 


1062 
1047 



Qy 1063 LESEVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVIjytm/FTYVGALFN 1122 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | I I I 
Db 1048 LESEVAISEELVQKYSNSALGHWSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFN 1107 

Qy 1123 GLTLLIIALISLFSVPVIYERHQAQIDHYLGLANKWKDAMAKIQAKIPGLKRKAE 1178 

MMIIIIIIIMMIIMIIII Mill :| MM: 

Db 1108 GLTLLILALISLFSIPVIYERHQVQIDHYLGLANKSVKDAMAKIQAKIPGLKRKAD 1163 



Search completed: September 3, 2004, 16:05:30 
Job time : 85.2611 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 



September 3, 2004, 16:02:30 ; Search time 23.2562 Seconds 

(without alignments) 
2615.013 Million cell updates/sec 

US-09-830-972-29 
5923 

1 MEDLDQSPLVSSSDSPPRPQ VKDAMAKIQAKIPGLKRKAE 1178 



389414 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 389414 seqs, 51625971 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : Issued_Patents_AA: * 

1 : /cgn2_6/ptodata/2/iaa/5A_COMB.pep: * 

2 : /cgn2_6/ptodata/2/iaa/5B_COMB.pep: * 

3 : /cgn2_6/ptodata/2/iaa/ 6A_C0MB . pep : * 

4 : /cgn2_6/ptodata/2/iaa/6B_COMB.pep: + 

5: /cgn2_6/ptodata/2/iaa/PCTUS_COMB.pep:* 

6 : /cgn2_6/ptodata/2/iaa/backf ilesl .pep : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
US-08-700-607-1 

Sequence JL, Application US/08700607 
Patent ]!io^_bQ5810S_J^ 
GENERAL INFORMATION: 

APPLICANT: Bandman, Olga 
APPLICANT: Au-Young, Janice 
APPLICANT: Goli, Surya K. 
APPLICANT: Hillman, Jennifer L. 

TITLE OF INVENTION: TWO NOVEL HUMAN NSP-LIKE PROTEINS 
NUMBER OF SEQUENCES: 9 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 
STREET: 3174 Porter Drive 
CITY: Palo Alto 
STATE: CA 
COUNTRY: U.S. 
ZIP: 94304 



COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 
SOFTWARE: FastSEQ Version 1.5 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 08/700, 607 
FILING DATE: Filed Herewith 
ATTORNEY/AGENT INFORMATION: 
NAME: Billings, Lucy J. 
REGISTRATION NUMBER: 36,749 
REFERENCE/ DOCKET NUMBER: PF-0114 US 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 415-855-0555 
TELEFAX: 415-845-4166 
INFORMATION FOR SEQ ID NO: 1: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 199 amino acids 
TYPE: amino acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 
IMMEDIATE SOURCE: 
LIBRARY:. 

CLONE: Consensus 
US-08-700-607-1 

Query Match 15.7%; Score 931; DB 2; Length 199; 

Best Local Similarity 99.0%; Pred. No. 8e-50; 

Matches 189; Conservative 0; Mismatches 2; Indels 0; Gaps 0; 

Qy 988 KTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSWAYIAIALLSVTISFRIYKGVI 1047 

I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I M I I I 1 II I I I I I I I I I I I I I I I I I I 
Db 9 KDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 68 

Qy 1048 QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKF 1107 

I I II I I I I I I I I I I I I I I I I II I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 69 QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKF 128 

Qy 1108 AVLMWVFTYVGALFNGLTLLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQ 1167 

I I I I II I I I I I I I I I I I I I I I I I I I I II I I I I I I I I II I M II I I I I I I I I II I I I I I I I 
Db 129 AVLMWFTWGTUjFNGLTLLIIAlLISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQ 188 

Qy 1168 AKIPGLKRKAE 1178 

I I I I I I I I I I I 
Db 189 AKIPGLKRKAE 199 



RESULT 2 
US-08-700-607-5 

Sequence 5, Application US/08700607 
Patent No. 5858708 
GENERAL INFORMATION: 

APPLICANT: Bandman, Olga 
APPLICANT: Au-Young, Janice 
APPLICANT: Goli, Surya K. 
APPLICANT: Hillman, Jennifer L. 



TITLE OF INVENTION: TWO NOVEL HUMAN NSP-LIKE PROTEINS 
NUMBER OF SEQUENCES : 9 
CORRESPONDENCE ADDRESS: 
; ADDRESSEE: Incyte Pharmaceuticals, Inc. 

; STREET: 3174 Porter Drive 

; CITY: Palo Alto 

STATE: CA 

COUNTRY: U.S. 

ZIP : 94304 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette 

COMPUTER: IBM Compatible 

OPERATING SYSTEM: DOS 
; SOFTWARE: FastSEQ Version 1.5 

; CURRENT APPLICATION DATA: 
; APPLICATION NUMBER: US/ 08/700 , 607 

; FILING DATE: Filed Herewith 

ATTORNEY/AGENT INFORMATION: 
; NAME: Billings, Lucy J. 

REGISTRATION NUMBER: 36,749 

REFERENCE/ DOCKET NUMBER: PF-0114 US 
TELECOMMUNICATION INFORMATION: 
; TELEPHONE: 415-855-0555 

TELEFAX : 415-845-4166 
; INFORMATION FOR SEQ ID NO: 5: 
; SEQUENCE CHARACTERISTICS: 

LENGTH: 77 6 amino acids 
; TYPE: amino acid 

; STRANDEDNESS: single 

; TOPOLOGY: linear 

; MOLECULE TYPE: peptide 
; IMMEDIATE SOURCE: 

; LIBRARY: GenBank 

CLONE: 307307 
US-08-700-607-5 



Query Match 13.3%; Score 787.5; DB 2; Length 776; 

Best Local Similarity 31.9%; Pred. No. 3.6e-40; 

Matches 229; Conservative 91; Mismatches 190; Indels 207; Gaps 25; 

Qy 588 ESEATPSPVLPDI — VMEAPLNSA VPSAGASVIQPSSSPLE ASSVNY 632 

I I I I I I I : : II: I : I : : I I : I : I 

Db 141 EELGTPGPSLPDVPGIESRGLFSSDSGIEMTPAESTEVNKILADPLDQMKAEAYKYIDIT 200 

Qy 633 — ESIKHEPENPPPYEEA MSVSLKVSGIKEEIKEPENINAAL QET 675 

I : I I : : : I I : : I : I I : : I I I : : : I 

Db 201 RPEEVKHQEQHHPELEDKDLDFKNKDTDISIKPEGVREPDK-PAPVEGKIIKDHLLEEST 259 



Qy 676 EAPYISIACDL 1 KET KL S AE PAP D F S D YS EMAKVEQ P VP DH S E LVE D 722 

I I I I II : I I II:: : : | : : : 

Db 2 60 FAPYID DLSEEQRRAPQITTPVKITLTEIEPSVE TTTQEKTPEKQDICLK 309 

Qy 723 SSPDSEPVDLFS DDS IPDVPQKQ DETVMLVKESLTETSF 761 

I II : I I III I : I M : : I I : 

Db 310 PSPDTVPTVTVSEPEDDSPGSITPPSSGTEPSAAESQGKGSISEDELITAIKEA 363 



Qy 



7 62 ESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELS 821 



=111= =1 II I II :: : 

Db 364 -KGLSYETAENPRPVGQLADRP EVKARSGPPTI PSPLDHEA 4 03 

^ 822 TAVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFP TLISSKTDSFS 868 

"M I II:: I I : I II 

Db 404 SSAESGD SEIELVSEDPMAAEDALPSGYVSFGHVGGPPPSP 444 

869 KLAREYTDLEVSHKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKI5FSD 921 

: I I : I : : I : : | : I I I III 

Db 44 $ ASPSIQYSILREEREAELDSELIIESCDASSAS -EESPKREQDSPPMKPSALD 496 

QY 922 DF SKNGSATSKVLL LPPDVSALGHTQAEIESIVKP 95 6 

1=11 I III II I I : : I 

Db 497 AIREETGVRAEERAPSRRGLAEPGSFLDYPSTEPQPGPELPPGDGAL EPETPMLP 551 

QY 957 KVLEKEAEKKLPSDTEKEDRSPSA IFSADLGKTSWDLLYWRDIK 1001 

= I = ::M:| :| I | :|||MIMI 
Db 552 RKPEEDSSSNQSPAATKGPGPLGPGAPPPLLF LNKQKAI DLLYWRDI K 599 

Qy 1002 KTGWFGASLFLLLSLTVFSIVSVTAYI7VLALLSVTISFRIYKGVIQAIQKSDEGHPFRA 1061 

= N : I I I : I II III II : I I I I I : I I I I I I I I I | | | I : I I : || : | || I I I : I 
Db 600 QTGIVFGSFLLLLFSLTQFSWSWAYLALAALSATISFRIYKSVLQAVQKTDEGHPFKA 659 

Qy 1062 YLES EVAI S EELVQKYSNS ALGHVMCTI KELRRLFLVDDLVD S LKFAVLMWVFT YVGAL F 1121 

111 I : : I : I =IN = = =M 1 = 111111111 IN HIM: | | | | | || 

Db 660 YLELEITLSQEQIQKYTDCLQFYWSTLKELRRLFLVQDLVDSLKFAVLMWLLTYVGALF 719 

Qy 1122 NGLTLLI LALI SLFSVPVI YERHQAQI DH YLGLANKNVKDAMAKIQAKI PGLKRKAE 1178 

I I I I I I : : I :: I : I : : | | : | : I I I I I 1 | | | | : : : I I I I I I I I | | | | | 
Db 72 0 NGLTLLIJy[AWSMFTLPVVYVKHQAQIDQYLGLVRTHINAVVAKIQAKIPGAKR^^AE 776 



RESULT 3 
US-08-700-607-6 

Sequence 6, Application US/08700607 
Patent No. 5858708 
GENERAL INFORMATION: 

APPLICANT: Bandman, Olga 
APPLICANT: Au- Young, Janice 
APPLICANT: Goli, Surya K. 
APPLICANT: Hillman, Jennifer L. 

TITLE OF INVENTION: TWO NOVEL HUMAN NSP-LIKE PROTEINS 
NUMBER OF SEQUENCES: 9 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 
STREET: 3174 Porter Drive 
CITY: Palo Alto 
STATE: CA 
COUNTRY: U.S. 
ZIP: 94304 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 
SOFTWARE: FastSEQ Version 1.5 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 08/7 00 , 607 



FILING DATE: Filed Herewith 
ATTORNEY/ AGENT INFORMATION: 
NAME : Billings, Lucy J. 
REGISTRATION NUMBER: 36,749 
REFERENCE/DOCKET NUMBER: PF-0114 US 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 415-855-0555 
TELEFAX: 415-845-4166 
INFORMATION FOR SEQ ID NO: 6: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 356 amino acids 
TYPE: amino acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 
IMMEDIATE SOURCE: 
LIBRARY: GenBank 
CLONE: 307309 
US-08-700-607-6 

Query Match 11.9%; Score 705; DB 2; Length 356; 

Best Local Similarity 42,6%; Pred. No. 1.4e-35; 

Matches 168; Conservative 55; Mismatches 86; Indels 85; Gaps 11; 

QY 803 EVSTLSKKEKIPLQMEELSTAVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISS 862 

: I I : : : I I : : I I I : I : || III:: I : : : 

Db 30 QYSILREEREAELDSELI 1 ES CDAS SAS EES PKRE QDSPPMK PSALDA 77 

QY 8 63 KTDSFSKLAREYTDLEVSHKSEIANAPDGAGSL PCTELPHDLSLKNIQPKVEEKISF 919 

III: : : II III III 
Db 78 IREETGVRAEERAPSRRGLAEPGSFLDYPSTE PQPGPE 115 

Qy 920 SDDFSKNGSATSKVLLLPPDVSALGHTQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPS 979 

III II I i : : I : | : :: | | : 

Db 116 LPPGDGAL EPETPMLP RKPEEDSSSNQSPA 145 

Qy 980 A 1 FSADLGKTS WDLLYWRDI KKTGWFGASLFLLLSLTVFSI VS 1024 

I : I M : I M I I I I I I : I I : I I I : I I I I I I I I : I I 
Db 146 ATKGPGPLGPGAPPPLLF LNKQKAIDLLYWRDIKQTGIVFGSFLLLLFSLTQFSWS 202 

Qy 1025 VTAYIAIALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGH 1084 

I I I : I I I II I I I I I I I I I : II : I I : I I II I I : I I I I I : : I : I : I I I : : : 
Db 203 WAYLALAALSATISFRIYKSVLQAVQKTDEGHPFKAYLELEITLSQEQIQKYTDCLQFY 2 62 

Qy 1085 VNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSVPVIYERH 1144 

M I : I I I I II I I I I I I I I I I I I I I I I : I I II I I I I I I I I I : : | : : | : | : : M : | : | 
Db 263 VNSTLKELRRLFLVQDLVDSLKFAVLMWLLTYVGALFNGLTLLLMAWSMFTLPVVYVKH 322 

Qy 1145 QAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1178 

I I I I I I I I I :: : I I I I I II I I I I 
Db 323 QAQIDQYLGLVRTHINAWAKIQAKI PGAKRHAE 356 



RESULT 4 
US-08-700-607-7 

; Sequence 7, Application US/08700607 
; Patent No. 5858708 



GENERAL INFORMATION: 

APPLICANT: Bandman, Olga 
APPLICANT: Au-Young, Janice 
APPLICANT: Goli, Surya K. 
APPLICANT: Hillman, Jennifer L. 

TITLE OF INVENTION: TWO NOVEL HUMAN NSP-LIKE PROTEINS 
NUMBER OF SEQUENCES : 9 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 
STREET: 3174 Porter Drive 
CITY: Palo Alto 
STATE: CA 
COUNTRY : U.S. 
ZIP : 94304 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 
SOFTWARE: FastSEQ Version 1.5 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 08 ll 00 , 607 
FILING DATE: Filed Herewith 
ATTORNEY/AGENT INFORMATION: 
NAME: Billings, Lucy J. 
REGISTRATION NUMBER: 36,749 
REFERENCE/DOCKET NUMBER: PF-0114 US 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 415-855-0555 
TELEFAX: 415-8 45-4166 
INFORMATION FOR SEQ ID NO: 7: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 208 amino acids 
TYPE: amino acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 
IMMEDIATE SOURCE: 
LIBRARY: GenBank 
CLONE: 307311 
US-08-700-607-7 



Query Match 11.6%; Score 688; DB 2; Length 208; 

Best Local Similarity 67.5%; Pred. No. 6.9e-35; 

Matches 129; Conservative 32; Mismatches 30; Indels 0; Gaps 0; 

Qy 988 KTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 1047 

I ' : I I I I I I I I I : I I : I I I : I II III II : I I I I I : I I I I I I I I I I I I I I : 
Do 18 KSQAIDLLYWRDIKQTGIVFGSFLLLLFSLTQFSWSWAYLALAALSATISFRIYKSVL 77 

Qy 1048 QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKF 1107 

M:||: Mill I: :|:| :|||:: :|| I : I I I f I I I I I MINIM 

Db 7 8 QAVQKTDEGHPFKAYLELEITLSQEQIQKYTDCLQFYVNSTLKELRRLFLVQDLVDSLKF 137 

Qy 1108 AVLMWFTWGALFNGLTLLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQ 1167 

Mill: I I I I I I I I I I I I I : : I : : I : I : : I I : I : | I | II I I II I : : Mill 
Db 138 AVLMWLLTYVGALFNGLTLLLMAWSMFTLPWYVKHQAQIDQYLGLVRTHINAWAKIQ 197 



Qy 1168 AKIPGLKRKAE 1178 

I I I II II II 
Db 198 AKI PGAKRHAE 208 



RESULT 5 
US-08-700-607-8 

Sequence 8, Application US/08700607 
Patent No. 5858708 
GENERAL INFORMATION: 

APPLICANT: Bandman, 01 ga 
APPLICANT: Au-Young, Janice 
APPLICANT: Goli, Surya K. 
APPLICANT: Hillman, Jennifer L. 

TITLE OF INVENTION: TWO NOVEL HUMAN NSP-LIKE PROTEINS 
NUMBER OF SEQUENCES: 9 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 
STREET: 3174 Porter Drive 
CITY: Palo Alto 
STATE: CA 
COUNTRY: U.S. 
ZIP: 94304 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 
SOFTWARE: FastSEQ Version 1.5 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/700, 607 
FILING DATE: Filed Herewith 
ATTORNEY/AGENT INFORMATION: 
NAME: Billings, Lucy J. 
REGISTRATION NUMBER: 36,749 
REFERENCE/DOCKET NUMBER: PF-0114 US 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 415-855-0555 
TELEFAX: 415-84 5-4166 
INFORMATION FOR SEQ ID NO: 8: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 267 amino acids 
TYPE: amino acid 
STRANDEDNESS: single 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 
IMMEDIATE SOURCE: 
LIBRARY: GenBank 
CLONE: 281046 
US-08-700-607-8 

Query Match 11.3%; Score 671; DB 2; Length 267; 

Best Local Similarity 66.3%; Pred. No. l.le-33; 

Matches 124; Conservative 34; Mismatches 29; Indels 0; Gaps 0 

Qy 988 KTSVVDLLYWRDIKKTGVVFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 104 

I : : I I I I I I I I I : M : II I : I II III I I : I I I I I : I I I i , I I I I I I I I I : 
9 KSQAIDLLYWRDIKQTGIVFGSFLLLLFSLTQFSWSWAYLALAALSATISFRIYKSVL 68 



Qy 



1048 QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKF 1107 



' • * ' • i • i i i • • ill 1*111111111 I I I I | I ( I 

69 QAVQKTDEGHPFKAYLELEITLSQEQIQKYTDCLQLYWSTLKELRRLFLVQDLVDSLKF 12 8 




Db 



Qy 




Db 



Qy 



1168 AKIPGLK 1174 



Db 



189 AKIPGAR 195 



RESULT 6 

US-09-149-476-563 

; Sequence 563, Application US/09149476 

; Patent No. 6420526 

; GENERAL INFORMATION: 

; APPLICANT: Rosen et al . 

; TITLE OF INVENTION: 18 6 Human Secreted proteins 

; FILE REFERENCE: PZ002P1 

; CURRENT APPLICATION NUMBER: US/ 09/ 14 9 , 47 6 

; CURRENT FILING DATE: 1998-09-08 

; EARLIER APPLICATION NUMBER: PCT/US98/ 04 4 93 

; EARLIER FILING DATE: 1998-03-06 

; EARLIER APPLICATION NUMBER: 60/040,162 

; EARLIER FILING DATE: 1997-03-07 

; EARLIER APPLICATION NUMBER: 60/040,333 

; EARLIER FILING DATE: 1997-03-07 

; EARLIER APPLICATION NUMBER: 60/038,621 

; EARLIER FILING DATE: 1997-03-07 

; EARLIER APPLICATION NUMBER: 60/040,626 

; EARLIER FILING DATE: 1997-03-07 

; EARLIER APPLICATION NUMBER: 60/040,334 

; EARLIER FILING DATE: 1997-03-07 

; EARLIER APPLICATION NUMBER: 60/04 0,336 

; EARLIER FILING DATE: 1997-03-07 

; EARLIER APPLICATION NUMBER: 60/040,163 

; EARLIER FILING DATE: 1997-03-07 

; EARLIER APPLICATION NUMBER: 60/047,600 

; EARLIER FILING DATE: 1997-05-23 

; EARLIER APPLICATION NUMBER: 60/047,615 

; EARLIER FILING DATE: 1997-05-23 

; EARLIER APPLICATION NUMBER: 60/047,597 

; EARLIER FILING DATE: 1997-05-23 

; EARLIER APPLICATION NUMBER: 60/047,502 

; EARLIER FILING DATE: 1997-05-23 

; EARLIER APPLICATION NUMBER: 60/047,633 

; EARLIER FILING DATE: 1997-05-23 

; EARLIER APPLICATION NUMBER: 60/047,583 

; EARLIER FILING DATE: 1997-05-23 

; EARLIER APPLICATION NUMBER: 60/047,617 

; EARLIER FILING DATE: 1997-05-23 

; EARLIER APPLICATION NUMBER: 60/047,618 

; EARLIER FILING DATE: 1997-05-23 

; EARLIER APPLICATION NUMBER: 60/047,503 



EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,592 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,581 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,584 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,500 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,587 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,492 
EARLIER FILING DATE : 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,598 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,613 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,582 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,596 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,612 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,632 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,601 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/043,580 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,568 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,314 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,569 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,311 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,671 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,674 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,669 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,312 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,313 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,672 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,315 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/048,974 
EARLIER FILING DATE: 1997-06-06 
EARLIER APPLICATION NUMBER: 60/056,886 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,877 
EARLIER FILING DATE: 1997-08-22 



EARLIER APPLICATION NUMBER: 60/056,889 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,893 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,630 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,878 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,662 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,872 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,882 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,637 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,903 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,888 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,879 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,880 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,894 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,911 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,636 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,874 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,910 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,864 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,631 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,845 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,892 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/057,761 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/047,595 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,599 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,588 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,585 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,586 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,590 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,594 



; EARLIER FILING DATE: 1997-05-23 

; EARLIER APPLICATION NUMBER: 60/047,589 

; EARLIER FILING DATE: 1997-05-23 

; EARLIER APPLICATION NUMBER: 60/047,593 

; EARLIER FILING DATE: 1997-05-23 

; EARLIER APPLICATION NUMBER: 60/047,614 

; EARLIER FILING DATE: 1997-05-23 

; EARLIER APPLICATION NUMBER: 60/043,578 

; EARLIER FILING DATE: 1997-04-11 

; EARLIER APPLICATION NUMBER: 60/043,576 

EARLIER FILING DATE: 1997-04-11 

; EARLIER APPLICATION NUMBER: 60/047,501 

; EARLIER FILING DATE: 1997-05-23 

; EARLIER APPLICATION NUMBER: 60/043,670 

EARLIER FILING DATE: 1997-04-11 

; EARLIER APPLICATION NUMBER: 60/056,632 

; EARLIER FILING DATE: 1997-08-22 

; EARLIER APPLICATION NUMBER: 60/056,664 

; EARLIER FILING DATE: 1997-08-22 

; EARLIER APPLICATION NUMBER: 60/056,876 

; EARLIER FILING DATE: 1997-08-22 

; EARLIER APPLICATION NUMBER: 60/056,881 

; EARLIER FILING DATE: 1997-08-22 

; EARLIER APPLICATION NUMBER: 60/056,909 

; EARLIER FILING DATE: 1997-08-22 

; EARLIER APPLICATION NUMBER: 60/056,875 

; EARLIER FILING DATE: 1997-08-22 

; EARLIER APPLICATION NUMBER: 60/056,862 

; EARLIER FILING DATE: 1997-08-22 

; EARLIER APPLICATION NUMBER: 60/056,887 

; EARLIER FILING DATE: 1997-08-22 

; EARLIER APPLICATION NUMBER: 60/056,908 

; EARLIER FILING DATE: 1997-08-22 

; EARLIER APPLICATION NUMBER: 60/048,964 

; EARLIER FILING DATE: 1997-06-06 

; EARLIER APPLICATION NUMBER: 60/057,650 

; EARLIER FILING DATE: 1997-09-05 

; EARLIER APPLICATION NUMBER: 60/056,884 

; EARLIER FILING DATE: 1997-08-22 

; EARLIER APPLICATION NUMBER: 60/057,669 

; EARLIER FILING DATE: 1997-09-05 

; EARLIER APPLICATION NUMBER: 60/049,610 

; EARLIER FILING DATE: 1997-06-13 

; EARLIER APPLICATION NUMBER: 60/061,060 
EARLIER FILING DATE: 1997-10-02 

Query Match 9.1%; Score 541.5; DB 4; Length 168; 

Best Local Similarity 60.5%; Pred. No. 5e-26; 

Matches 101; Conservative 34; Mismatches 31; Inciels 1; Gaps 1; 
Qy 1013 LLLSLTVFSIVSWAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEE 1072 



' 1 " ' • n i i ii ii ii ii ii i i i l • l l I • I I M I • I I ! • : : • I I 

1 MLLSLAAFSVISWSYLILALLSVTISFRIYKSVIQAVQKSEEGHPFKAYLDVDITLSSE 60 





Db 



Qy 



1073 LVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALI 1132 



Db 




Qy 1133 SLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAi(IPGL-KRKAE 1178 

IMII:|:| I : | | | || : | | : | : | | | 

Db 121 LIFSVPIVYEKYKTQIDHYVGIARDQTKSIVEKIQAKLPGIAKKKAE 167 



RESULT 7 
US-08-700-607-3 

Sequence 3, Application US/08700607 
Patent No. 5858708 
GENERAL INFORMATION: 

APPLICANT : Bandman, Olga 
APPLICANT: Au-Young, Janice 
APPLICANT: Goli, Surya K. 
APPLICANT: Hillman, Jennifer L. 

TITLE OF INVENTION: TWO NOVEL HUMAN NSP-LIKE PROTEINS 
NUMBER OF SEQUENCES: 9 
CORRESPONDENCE ADDRESS : 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 
STREET: 3174 Porter Drive 
CITY: Palo .Alto 
STATE: CA 
COUNTRY: U.S. 
ZIP: 94304 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 
SOFTWARE: FastSEQ Version 1.5 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/7 00,607 
FILING DATE: Filed Herewith 
ATTORNEY/AGENT INFORMATION: 
NAME: Billings, Lucy J. 
REGISTRATION NUMBER: 36,749 
REFERENCE/ DOCKET NUMBER: PF-0114 US 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 415-855-0555 
TELEFAX: 415-845-4166 
INFORMATION FOR SEQ ID NO: 3: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 241 amino acids 
TYPE: amino acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 
IMMEDIATE SOURCE: 

LIBRARY: THP1NOB01 
CLONE: 31870 
US-08-700-607-3 



Query Match 8.7%; Score 513; DB 2; Length 241; 

Best Local Similarity 47.7%; Pred. No. 4.7e-24; 

Matches 102; Conservative 37; Mismatches 55; Indels 20; Gaps 1; 

Qy 963 AEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSI 1022 

II I I : : : I I I : I I I : I I I I I I I : I : I I I I II: 



Db 



20 AEPSAPGGGGSPGACPALGTKSCSSSCAVHDLIXWRDVKKTGFVFGTTLIMLLSLAAFSV 79 



Qy 1023 VSVTAYIALALLSVTISFRI YKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSAL 1082 

; ll : I : I I I M I I I I I I I I I II I I : I I I : I I I | | : | | | : :: :| | | 
Db 80 ISW SYLILALLSVTISFRIYKSVIQAVQKSEEGHPFKAYLDVDITLSSEAFHNYMNAAM 139 

Qy 1083 GHVNCTIKELRRLFLVDDLVDSLKFAVLMWFTYVGALFNGLTLLILALISLFSVPVIYE 1142 

I 5 ' ; l : M I I I r | | | | J | | || ||: |||||:|||:|||||| : : | M:: | 
Db 140 VHINRALKLIIRLFLVEDLVDSLKl^VFMWLMTYVGAVFNGITLLILAELLIXSVPIVYX 199 

Qy 1143 RHQAQI DHYLGLANKNVKDAMAKIQAKI PGLKRK 117 6 

: : I : : I I : : I 

Db 200 KY KVPSKTPWNRQK 213 



RESULT 8 

US-09-621-976-4600 

; Sequence 4600, Application US/09621976 
; Patent No. 6639063 
; GENERAL INFORMATION: 

; APPLICANT: Dumas Milne Edwards, J.B. 

; APPLICANT: Jobert, S. 

; APPLICANT: Giordano, J.Y. 

; TITLE OF INVENTION: ESTs and Encoded Human Proteins. 

FILE REFERENCE: GENSET . 054PR2 
; CURRENT APPLICATION NUMBER: US/09/621, 976 
; CURRENT FILING DATE: 2000-07-21 
; NUMBER OF SEQ ID NOS : 19335 
; SOFTWARE: Patent. pm 
; SEQ ID NO 4600 

LENGTH: 75 

TYPE: PRT 

ORGANISM: Homo sapiens 
FEATURE : 
; NAME/KEY: UNSURE 
LOCATION: 58 

OTHER INFORMATION: Xaa = His, Pro 
NAME/ KEY: UNSURE 
LOCATION: 28 

OTHER INFORMATION: Xaa = Met,Val 
NAME/ KEY: UNSURE 
; LOCATION: 19 

OTHER INFORMATION: Xaa = Pro, Gin 
NAME/KEY: UNSURE 
LOCATION: 53 

OTHER INFORMATION: Xaa = Ser,Tyr 
US-09-621-976-4600 



Query Match 5.8%; Score 344.5; DB 4; Length 75; 

Best Local Similarity 82.6%; Pred. No. 1.9e-14; 

Matches 71; Conservative 0; Mismatches 4; Indels 11; Gaps 1; 

1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1 MEDLDQSPLVSSSDSPPRXQPAFKYQFXREPEDEE EDLEELEVLERKPA 4 9 

Qy 61 AGL S AAP VPT APAAGAP LMD FGND FV 8 6 



Ill 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 50 AGLXAAPVXTAPAAGAPLMDFGNDFV 75 



RESULT 9 

US-09-621-976-4601 

Sequence 4601, Application US/09621976 
Patent No. 6639063 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 



Dumas Milne Edwards, J.B. 
Jobert, S . 
Giordano, J.Y. 
TITLE OF INVENTION: ESTs and Encoded Human Proteins. 
FILE REFERENCE: GENSET. 054PR2 
CURRENT APPLICATION NUMBER: US/ 09/621, 976 
CURRENT FILING DATE: 2000-07-21 
NUMBER OF SEQ ID NOS : 19335 
SOFTWARE: Patent. pm 
SEQ ID NO 4601 
LENGTH: 75 
TYPE: PRT 

ORGANISM: Homo sapiens 
FEATURE: 

NAME/ KEY: UNSURE 
LOCATION: 58 

OTHER INFORMATION: Xaa = His, Pro 
NAME/ KEY: UNSURE 
LOCATION: 28 

OTHER INFORMATION: Xaa = Met,Val 
NAME/KEY: UNSURE 
LOCATION: 19 

OTHER INFORMATION: Xaa = Pro, Gin 
NAME/ KEY: UNSURE 
LOCATION: 53 

OTHER INFORMATION: Xaa = Ser,Tyr 
US-09-621-976-4601 

Query Match 5.8%; Score 344.5; DB 4; Length 75; 

Best Local Similarity 82.6%; Pred. No. 1.9e-14; 

Matches 71; Conservative 0; Mismatches 4; Indels 11; Gaps ] 

Qy 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I 

Db 1 MEDLDQSPLVSSSDSPPRXQPAFKYQFXREPEDEE EDLEELEVLERKPA 49 

Qy 61 AGL S AAP VP T AP AAGAP LMD FGND FV 86 

III I I I I I I I I I I I I I I I I I I I I I 
Db 50 AGLXAAPVXTAPAAGAPLMDFGNDFV 75 



RESULT 10 
US-08-714-741-32 

Sequence 32, Application US/08714741 
Patent No. 6500613 
GENERAL INFORMATION: 

APPLICANT: Briles, David E. 
APPLICANT: McDaniel, Larry S. 



APPLICANT: Swiatlo, Edwin 
APPLICANT: Yother, Janet 
APPLICANT: Crain, Marilyn J. 
APPLICANT: Hollingshead, Susan 
APPLICANT: Tart, Rebecca 
APPLICANT: Brooks-Walter, Alexis 

TITLE OF INVENTION: PNEUMOCOCCAL GENES, PORTIONS THEREOF, 

TITLE OF INVENTION: EXPRESSION PRODUCTS THEREFROM, AND USES OF SUCH GENES, 
TITLE OF INVENTION: PORTIONS AND PRODUCTS 
NUMBER OF SEQUENCES: 47 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Curtis, Morris & Safford, P.C. 
STREET: 530 Fifth Avenue 
CITY: New York 
STATE: New York 
COUNTRY: U.S. 
ZIP: 10036 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/714 , 741 
FILING DATE: 16-SEP-1996 
CLASSIFICATION: 435 
ATTORNEY/AGENT INFORMATION: 

NAME: Frommer Esq., William S. 
REGISTRATION NUMBER: 25,506 
REFERENCE/ DOCKET NUMBER: 454312-2460 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (212) 840-3333 
TELEFAX: (212) 840-0712 
INFORMATION FOR SEQ ID NO: 32: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 8 991 amino acids 
TYPE: amino acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: amino acid 
US-08-714-741-32 



Query Match 5.3%; Score 316; DB 4; Length 8991; 

Best Local Similarity 21.4%; Pred. No. 9.9e-10; 

Matches 248; Conservative 138; Mismatches 479; Indels 296; Gaps 52; 

Qy 6 QSPLVSSSDSP-PRPQPAFKYQF-VREPEDEEEEEEEEEEDEDEDLEE — LEVLERKPAA 61 

: : I : I I : I I I I : : | : I : : : : : I I I | | : : : : | 

Db 7193 EAPAEQPKPAPAPQPAPAPKPEKPAEQPKPEKTDDQQAEEDYARRSEEEYNRLTQQQPPK 7252 

Qy 62 GLSAAPVP TAPAAG APLMDF 81 

M I III II 

Db 7253 AEKPAPAPKTGWKQENGMWYFYNTDGSMGEQAGQYRAAAEGDLAAKQAELEKTEADLKKA 7312 

QY B2 GNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAPSPLSAAAVSPSKLPED DE 138 

I = IN I I I I I I I 1:1 I I I I | : : : I : I 

Db 7313 VNEPEKPAPAPETPA--PEAPAEQPK — P AP AP Q P AP AP K P E K P AEQ P KAE KT DDQQAE E 7368 



Q y 139 PPAR PPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRGSSGAWXXX 189 

I I I I : I I Mill I I : | : 

Db 7369 DYARRSEEEYNRLTQQQPPKAEKPAPAPKPEQPAPAPKNSKGEQAEQYRSAAGGDLAAKQ 742 8 

190 XKI MDLKEQPGNTISAGQEDFPSVLLET-AASXPSLSPLSAASFKEHEYLGNLSTV 244 

: : Ml: : : : I : | | | | : | | : : 
Db 742 9 VELEKTEADLKK AVNE PEKPAPAPET PAP EAPAEQPKPAPAPQ 7471 

2V 245 LPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEMGSSFSVSPKAESAVIVAN 304 

I I = I I : I I : | | | | | : : : | | | | 

Db 7472 ~PAPAPKPEKPAEQPK~-AEKPADQQAEEDYDRRSEEEYNRL — TQQQPPKAEKPAPAPQ 7526 

305 PREEI 1 VKNKDEEEKLVSNNI LHXQQELPTALTKLVKEDEWS S EKAKDS FNE 357 

I : : II:: IN I I I : I : | : | 

Db 7527 PEQPAPAPKSLKEIDESDSEDYVKEGFRAPLQSELDAKQAKLSKLEEL SDKIDE 7580 

358 KRVAVEAPMREEYADFKPFERWEVKDSKEDSDMIAAGGKIESNLESKVDKKCFADSLEQ 417 
: I : : : M I II I I : : | : | | | | | 

Db 7581 LDAEI-AKLEKDVEDFK XSDGEQAGQYLAAAEEDLIAKKA ELEQ 7623 

Qy 418 TNHEKDSESSNDDTSFPS — TPEGIKDRSGAYITCAPFNPAATESIATNI FPLLEDPTS 474 

I I I : : :: I : | | I I I I I | 
Db 7624 T — EADLKKAVNEPGKPAPAPAPE TPAPEAPAEQPK PAP 7660 

Qy 47 ^ ENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYV--TTDNLTKVTEEWA-- 530 

I II' I M I I : I I I : : : : I : : I 

Db 7661 ETPAPAPKPEKPAEQPKPEK PADQQAEEDYARRSEEEYNRLTQQQPAPA 7709 

Qy 531 NMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCP 584 

II : I : : I : : : : | I | I I : : : | 

Db 7710 QKPEQPAKPEKPAEEPTQPEKDAEIAKLE-KNVEYFKKTDAEQTEQYLAAAEKDLADKKA 7768 

^ 585 SFEESEA TPSPV-LPDIVMEAPLNSAVPSAGASVIQPSSSPLEAS 628 

I I I | | : | | : I | : I I : | : : : 

Db 7769 ELEKTEADLKKAVNEPEKPAEET PAPAPKPEQPAEQPKPAPAPQP-APAPKPEKTDDQQA 7827 

Qy 629 SVNYESIKHEPENPPPYEEAMSVSLKVSGIKEE IKEPENINAALQETEAPYI SIAC 684 

: I III:: II : | | | I I : I 

Db 7828 EEDYARRSEEEYNRLPQQQPPKAEKPAPAPKPEQPVPAEXPEN PAPAPKPAXAP 78 81 

QY 685 DLIKETKLSAEPAPDFSDYSEMAKVEQPV PDHSELVEDSS PDS 727 

: I : : : I I : I hi I I : I I : | 

Db 7882 QPLKPEEPAEQPKPE KPEEPAGQPEPEKPDDQQAGEDYARRSGGEYNRFPQQ 7933 

Qy 728 EPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMIEYENKEKLSALPPEGG 781 

: I I I :: | | | |||: : 

Db 7934 QPPKAEKPAPAP KPEQPVPAPKTLL KKAKLAGAKSKAATKKAEL 7977 

QY 7 82 KPYLESFKLSLDNTKDTLLP DEV STLSKK-EKIPLQMEELSTAVYSNDD 82 9 

M II : hi Ml II: : I : I I I : I I : II : :| 

Db 7978 EPELEKAEAELENLLSTLDPEGKTQDELDKEAAEAELNKKVEALPNQVSELEEELSKLED 8037 

Qy 83 0 LFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLARE YTDLEVSHKSEI 885 

1 = 1: I : I : |::| | : | : | | 

Db 8038 NL— KDAETNNVEDY IKEGLEEAIATKQAELEKTPKELDAALNELGPDGDEEE 8088 



Qy 886 ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKI — SFSDDFSKNGSATSKVLLLPPDVSAL 943 

I: III: III: :|::: | 
Db 8089 TPPPEAPAEQPKPEKPAE-ETPAPAPKPEKSADQQAEEDYARRSEEEYNRL 8138 

QY 944 GHTQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRD I 1000 

M : II I M I : I :: III :: | | : 

Db 8 139 —TQQQ PPK AEKPAPAPAPKPEQPAPAPKSRGLATKKKLNLAEARIELLL 8186 

Qy 1001 KKTGV VFGASLFLLLS 1016 

Ml: Ml Ml 

Db 8187 KKLGLEPGLEKAGAGLGNLLS 8207 



RESULT 11 
US-09-976-594-726 

; Sequence 726, Application US/09976594 

; Patent No. 6673549 

; GENERAL INFORMATION: 

; APPLICANT: Furness, Michael 

; APPLICANT: Buchbinder, Jenny 

; TITLE OF INVENTION: GENES EXPRESSED IN C3A LIVER CELL CULTURES TREATED WITH 
STEROIDS 

; FILE REFERENCE: PA- 00 41 US 

; CURRENT APPLICATION NUMBER: US/ 09/ 97 6, 594 

; CURRENT FILING DATE: 2001-10-12 

; PRIOR APPLICATION NUMBER: 60/240,409 

; PRIOR FILING DATE: 2000-10-12 

; NUMBER OF SEQ ID NOS : 1143 

SOFTWARE: PERL Program 
; SEQ ID NO 726 

LENGTH: 24 68 
; TYPE: PRT 

ORGANISM: Homo sapiens 

FEATURE : 

NAME/KEY: misc_feature 

OTHER INFORMATION: Incyte ID No. 6673549 4295277CD1 
US-09-976-594-726 



Query Match 5.0%; Score 296.5; DB 4; Length 2468; 

Best Local Similarity 21.5%; Pred. No. 2.5e-09; 

Matches 264; Conservative 148; Mismatches 459; Indels 355; Gaps 59; 

EDLDQSPLVSSS-DSPPRPQPAFKYQ FVREP EDEE 35 

II:: I I : I II : : I : : : | | | | : 



I I I : I I I : I I II I : I II I I I 



: I I : I : I I : I I II I : I | | : 

-AQSPGREPA— SSIHDETLPGGSESEATA SDEENREDQPEEFTATSGYT 1120 

WTPPAPAPAAPPSTPAAPKRRGSSGAWXXXXKIMDLKEQPGNTISAGQE D 2 09 

I I Ml :| : I I II 

IEISSEPTPMDEMSTP RDVMSDETNNEETESPSQEFVNITK 1164 



Qy 


2 


Db 


956 


Qy 


36 


Db 


1016 


Qy 


95 


Db 


1074 


Qy 


155 


Db 


1121 



Qy 210 FPSVLLETAASXPS-LSPLSAAS FKE HEYLGNLSTVLPTEGTLQENVSEAS 259 

= I I I I : : : I I : | | : | : | | : | : : | : : 

Db 1165 YESSLYSQEYSKPADVTPLN— GFSEGSKTDATDGKDYNASASTISPPSSMEEDKFSRSA 1222 

QY 260 KEVSEKAKTLLIDRDLTEF SELEYSEMGS SFS 291 

111111:1 I II : :| :|| 

Db 1223 LRDAYCSEV--KASTTLDIKDSISAVSSEKVSPSKSPSLSPSPPSPLEKTPLGERSVNFS 1280 

Qy 292 VSP KAESAVIVANPR — EEIIVKN — KDEEEKL VSNNILHX 328 

: : I I I : I : I : I : : : : I : : | | : : | 

Db 1281 LTPNEIKVSAEAEVAPVSPEVTQEWEEHCASPEDKTLEWSPSQSVTGSAGHTPYYQSP 1340 

Qy 329 QQELPTALTKLVKEDEWSSEKAKDSFNEKRVAVEAPMREEYADFK-PFERVWEVK 383 

Ml: : I I I : I : : I I : M I I : I I : I 
Db 1341 TDEKSSHLPTEV — IEKPPAVPVSFEFSDAKDENERASVS PMDEPVPDSESPIEKVLSPL 1398 

Qy 384 DSKEDSDMLAAGGKIESNLESKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDR 443 

I : : : III: II I I I : : I I : 

Db 1399 RS PPLIGSESAYESFLSA — DDKASGRGAESPFEEKSGKQGSPDQVS PVSE 1447 

Qy 444 SGAYITCAPFNPAATESIATNIFPLLEDPTSENXTDEKK IEEKK A 488 

: I I : | : | : | | I I I : : : : | : | 

Db 1448 MTSTSLYQDKQEGKSTDFAPIKEDFGQEKKTDDVEAMSSQPALALDERKLGDVSPT 1503 

Qy 489 QI --VTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTEEWA — N 531 

M : : I I I : : I I : I I : : | : | | : 

Db 1504 QIDVSQFGSFKEDTKMSISEGTVSDKSATP-- VDEGVAEDTYSHMEGVASVSTASVATSS 1561 

Qy 532 MPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQES-LYP7VAQLCP 584 

II III : : I : I : : : : I I I I I : : | : : | | 

Db 1562 FPEPTTDD-VSPSLHAEVGSPHSTEVDDSLSVSWQTPTTFQETEMSPSKEECPRPMSIS 1620 

Qy 585 S FEE S EAT P S P VL P - DIVMEAP LN S AVP SAGAS VI 618 

II : : I I I : : I : I : I I I : 
Db 1621 PPDFSPKTAKSRTPVQDHRSEQSSMSIEFGQESPEQSLAMDFSRQSPDH PTVGAGVL 1677 

Qy 619 Q PSSSPLEASSVNYESIKHE — PENPPPY EEAMSVS-LKVSGIKEEIKE 664 

I: I : I: I: I II I :| I I :: I 

Db 1678 HITENGPTEVDYSPSDMQDSSLSHKIPPMEEPSYTQDNDLSELISVSQVEASPSTSSAHT 1737 

Qy 665 PENINAALQETE AP YISIACDLIKE TKLSAEP A 697 

I I : II I II ||:::: | | | : : 

Db 1738 PSQIASPLQEDTLSDVAPPRDMSLYASLTSEKVQSLEGEKLSPKSDISPLTPRESSPLYS 1797 

Qy 698 PDFSDYSEMAKVEQPVPDHSELVEDSSPDSEPVDLFS 734 

I Ml: II: II IN I : I | 
Db 1798 PTFSDSTSAVK-EKTATCHS SSSP PI DAAS AE P YGFRAS VL FDTMQHHLALNR 1849 

Qy 735 DDSIPDVP QKQDETVMLVKESLTETS FESMI EYENKEKLSALPPE 779 

III: -11:11 I : I : I I : I I : 

Db 1850 DLSTPGLEKDSGGKTPGDFSYAYQKPEET TRSPDEEDYDYESYEKTTRTSDV 1901 

Qy 78 0 GGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVYSNDDLFISKEAQIR 839 

I I I I : : : I : I I : I I I I I : | | : 

Db 1902 GGY-YYEKIERTTKSPSDSGYSYE— TIGKTTKTP EDGDYSYE — IIEKTTRTP 1950 



Qy 



840 



ETETFSDSSPIEIIDEFPTLISSKTDS FSKLAREYTDLEVSHKSEIANAPDGAGSLP 8 96 

I : I I I I I I I : I I : : I : I I : 
EEGGYSYD ISEKTTSPPEVSGYSYEKTERSRRLLDDISNGYDDS- 19 94 



Db 



1951 



Qy 



897 



CTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGHTQAE-IESIVK 955 

: i I : : I I I : : II III : I I I : 

-EDGGHTLGDPSYSYETTEKITSFPESEGYSYETSTKTTRTPDTSTYCYETAEKITRTPQ 2 053 



Db 



1995 



Qy 



956 



PKVLEKE AEKKLPSDTEKE 974 

I I I II I I : : : 

ASTYSYETSDLCYTAEKKSPSEARQD 2079 



Db 



2054 



RESULT 12 
US-09-149-476-411 

; Sequence 411, Application US/09149476 

; Patent No. 6420526 

; GENERAL INFORMATION: 

; APPLICANT: Rosen et al. 

; TITLE OF INVENTION: 186 Human Secreted proteins 
; FILE REFERENCE: PZ002P1 

; CURRENT APPLICATION NUMBER: US/09/149,476 
; CURRENT FILING DATE: 1998-09-08 
; EARLIER APPLICATION NUMBER: PCT/US98/04493 
; EARLIER FILING DATE: 1998-03-06 
; EARLIER APPLICATION NUMBER: 60/040,162 
EARLIER FILING DATE: 1997-03-07 
EARLIER APPLICATION NUMBER: 60/040,333 
; EARLIER FILING DATE: 1997-03-07 
; EARLIER APPLICATION NUMBER: 60/038,621 
; EARLIER FILING DATE: 1997-03-07 
; EARLIER APPLICATION NUMBER: 60/040,626 
; EARLIER FILING DATE: 1997-03-07 
; EARLIER APPLICATION NUMBER: 60/040,334 
; EARLIER FILING DATE: 1997-03-07 
; EARLIER APPLICATION NUMBER: 60/040,336 
; EARLIER FILING DATE: 1997-03-07 
; EARLIER APPLICATION NUMBER: 60/040,163 
; EARLIER FILING DATE: 1997-03-07 
; EARLIER APPLICATION NUMBER: 60/047,600 
; EARLIER FILING DATE: 1997-05-23 
; EARLIER APPLICATION NUMBER: 60/047,615 
; EARLIER FILING DATE: 1997-05-23 
; EARLIER APPLICATION NUMBER: 60/047,597 
; EARLIER FILING DATE: 1997-05-23 
; EARLIER APPLICATION NUMBER: 60/047,502 
; EARLIER FILING DATE: 1997-05-23 
; EARLIER APPLICATION NUMBER: 60/047,633 
; EARLIER FILING DATE: 1997-05-23 
; EARLIER APPLICATION NUMBER: 60/047,583 
; EARLIER FILING DATE: 1997-05-23 
; EARLIER APPLICATION NUMBER: 60/047,617 
; EARLIER FILING DATE: 1997-05-23 
; EARLIER APPLICATION NUMBER: 60/047,618 
; EARLIER FILING DATE: 1997-05-23 
; EARLIER APPLICATION NUMBER: 60/047,503 
; EARLIER FILING DATE: 1997-05-23 



EARLIER APPLICATION NUMBER: 60/047,592 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,581 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,584 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,500 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,587 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,492 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,598 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,613 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,582 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,596 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,612 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,632 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,601 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/043,580 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,568 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,314 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,569 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,311 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,671 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,674 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,669 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,312 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,313 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,672 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,315 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/048,974 
EARLIER FILING DATE: 1997-06-06 
EARLIER APPLICATION NUMBER: 60/056,886 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,877 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,889 



EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,893 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,630 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,878 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,662 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,872 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056, 882 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,637 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,903 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,888 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,879 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,880 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,894 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,911 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,636 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,874 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,910 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,864 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,631 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,845 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,892 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/057,761 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/047,595 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,599 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,588 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,585 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,586 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,590 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,594 
EARLIER FILING DATE: 1997-05-23 



EARLIER APPLICATION NUMBER: 60/047,589 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,593 
EARLIER FILING DATE : 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,614 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/043,578 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,576 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/047,501 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/043,670 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/056,632 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,664 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,876 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,881 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,909 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,875 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,862 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,887 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,908 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/048,964 
EARLIER FILING DATE: 1997-06-06 
EARLIER APPLICATION NUMBER: 60/057,650 
EARLIER FILING DATE: 1997-09-05 
EARLIER APPLICATION NUMBER: 60/056,884 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/057,669 
EARLIER FILING DATE: 1997-09-05 
EARLIER APPLICATION NUMBER: 60/049,610 
EARLIER FILING DATE: 1997-06-13 
EARLIER APPLICATION NUMBER: 60/061,060 
EARLIER FILING DATE: 1997-10-02 

Query Match 4.8%; Score 285; DB 4; Length 92; 

Best Local Similarity 57.1%; Pred. No. 1.2e-10; 

Matches 52; Conservative 20; Mismatches 19; Indels 0; Gaps 
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Db 



Qy 



1139 VI YERHQAQI DHYLGLANKNVKDAMAKI QAK 1169 



Db 



62 I VYEKYKTQI DHYVGI ARDQTKS IVEKI P S K 92 



RESULT 13 
US-08-973-462-8 

Sequence 8, Application US/ 089734 62B 
Patent No. 6191270 
GENERAL INFORMATION: 
APPLICANT: DRUILHE, PIERRE 
APPLICANT: DAUBERS I ES, PIERRE 

TITLE OF INVENTION: MALARIAL PRE-ERYTHROCYTIC STAGE POLYPEPTIDE MOLECULES 
FILE REFERENCE: 0660-0125-0 PCT 
CURRENT APPLICATION NUMBER: US/08/973, 462B 
CURRENT FILING DATE: 1998-02-06 
EARLIER APPLICATION NUMBER: PCT/FR96/00894 
EARLIER FILING DATE: 1996-06-12 
EARLIER APPLICATION NUMBER: FR 95/07007 
EARLIER FILING DATE: 1995-06-13 
NUMBER OF SEQ ID NOS : 29 
SOFTWARE: PatentlnVer. 2.0 
SEQ ID NO 8 
LENGTH: 178 6 
TYPE: PRT 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: Description of Artificial Sequence : Polypeptide 
US-08-973-462-8 

Query Match 4.7%; Score 276.5; DB 3; Length 1786; 

Best Local Similarity 20.1%; Pred. No. 2.6e-08; 

Matches 232; Conservative 201; Mismatches 471; Indels 249; Gaps 50; 

Qy 3 0 EPEDEEEEEEEEEEDEDEDLEE L E VL E RK P AAG L S AAP V P T AP AAGAP LMD F GN 8 3 

I i I : I I I I : : I : : I I : I h = : = I I : : 
Db 228 EESVEENDEESVEENVEENVEENDDGSVASSVEES IAS SVDES I DS S I EENVAPTVE 284 

Q y 84 DFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAPSPLSAAAVSPSKLPEDDEPPAR- 142 

: | I : III : I : : I III : I I 

Db 285 EIVAPS WESVAPSVEESVEENVEESVAENVEESVAENVEESVAENVEESVAEN 338 

Qy 143 ppppppASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRGSSGAWXXXXKIMDLKEQ 198 

I I : I I : I M : : I I 

Db 339 VEEIVAPTVEEIVAPTVEEIVAPSWESVAPSVEESVEENVEESVAENVEESVAENVEES 398 

Qy 199 PGNTI SAGQEDFPSVLLETAAS XPSLSPLSAASFKEHEYLGNLSTVLPT-EGTL 251 

: :| : |: | |:: : | : :| : :| |: I :: 

Db 399 VAENV EESVAENVEESVAENVEEIVAPTVEEIVAPTVEEIVAPSWESVAPSVEESV 4 55 

Qy 252 QENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEMGSSFSVSPKAESAVI VANP 305 

:||| I: I I:: :: : I I :| II: I :l M 

Db 456 EENVEESVAENVEESVAENVEESVAENVEESVAE-NVEESVAENVEESV7VENVEESVAEN 514 



Qy 



306 REEIIVKNKDE EEKLVSNNILHX QQELPTALTKLVKED EW 34 6 

|||: : I | : : | : : : : :::::: | : | I 

Db 515 VEEIVAPTVEEIVAPTVEEIVAPSWESVAPSVEESVEENVEESVAENVEESVAENVEES 574 

Qy 347 SSEKAKDSFNEKRVAVEAPMREEYA DFKPFERVWEVKDSKEDSDMLAAGGKIESN 4 01 

:| ::| i : II II : I I I hi :l I 

Db 575 VAENVEESVAENVEEIVAPTVEEIVAPTVEEIVAPSWESVAPSVEES VEEN 62 6 



Qy 402 LESKVDKKCFADSLEQTNHE — KDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATE 459 

: I I | : : : | : : I : : I : I : 111 : I hi 
Db 627 VEESV AENVE E S VAEN VEE S VAEN VE E I VAP T VE E I VAP T VE E I VA PSWE 677 

Qy 460 SIATNIFPLLEDPTSENXTDE--KKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVT 517 

|:| I : I : I I : : : I I I : I I : II I I 

Db 678 SVA PSVEESVEENVEESVAENVEESVAENVEESVAENVEES VAEN VE E I VAP T 730 

Qy 518 TDNLTKVT-EEWA-NMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTS--EVMQ 573 

: : | I I : I I : : I : I I : I : I : I : I : : I : I I : : 

Db 731 VEEIVAPTVEEIVAPSWESVAPS-VEESVEENVEESVAENVE-ESVAENVEESVAENVE 788 

Qy 574 ESLYPAAQ — LCPSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSS PLEAS SVN 631 

I I : I :: I I I I I I I I I I : : : : = : = I : 

Db 789 ESVAPTVEEIVAPSVEESVA-PS VEESVAENVATNLSDNLLSNLLGGIETEEIK 841 

Qy 632 YESIKHEPENPPP YEEAMSVSLKV-SGIKEEIKE PENINAALQETEA 677 

: I I : I I I I : I : I I I I I : I : I hi 

Db 842 -DSILNEIEEVKENVVTTILENVEETTAESVTTFSNILEEIQENTITNDTIEEKLEELHE 900 

Qy 678 PYISIA CDLIKETKLSAEPAPDFSDYSEMA— KVEQPVPDHSELVED 722 

: | | h h I I II : I I I : : I : h 

Db 901 NVLSAALENTQSEEEKKEVIDVIEEVK — EEVATTLI ETVEQAEEKSANT ITEI FENLEE 958 

Qy 723 SSPDS EPVDLFSDDSI PDVPQKQDETVMLVKESLTET 759 

: : : I | : : : : I I : I I I : III 

Db 959 NAVESNENVAENLEKLNETVFNTVLDKVEETVEISGESLENNEMDKAFFSEIFDNVKGIQ 1018 

Qy 760 SFESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKK 810 

| |: I : : : I h : : I : : : I I : I I : : : : I 

Db 1019 ENLLTGMFRSIETSIVIQSEEKVDL NENWSSILDNIENMKEGLL-NKLENISST 1072 

Qy 811 E KIPLQMEE LSTAVYSNDDLF 831 

| : | : : I :: :|:| 

Db 1073 EGVQETVTEHVEQNVYVDVDVPAMKDQFLGILNEAGGLKEMFFNLEDVFKSESDVITVEE 1132 

Qy 832 ISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEIA 886 

: M : : I I : : hi II I : : I I 

Db 1133 IKDEPVQKEVE-KETVS I IEEMEENI VDVLEEEKEDLTDKMI DAVEES I EI S S DS KEETE 1191 

Qy 887 NAP D GAG — SLPCTELPHD LSLKNIQPK-VEEKISFSDDFSKNGSATSKVL 934 

: I || I : : I I 1 I :::::::: | I I I : : 

Db 1192 SIKDKEKDVSLWEEVQDNDMDESVEKVLELKNMEEELMKDAVEIND ITSKLI 1244 

Qy 935 LLPPDVSALGHTQAEIESIVKPKVLEKEAEKKLPSDTE KEDRSPSAI FSADLGKT 989 

: | : I: : : I I I I I I h : hi I I 

Db 1245 EETQELNEVEAD LIKDMEKLKELEKALSEDSKEIIDAKDDTLEKVIEEEHDITT 1298 

Qy 990 SWDLLYWRDIKK 1002 

Db 12 99 T LDE WELKDVEE 1311 



RESULT 14 
US-09-345-473E-40 

; Sequence 40, Application US/09345473E 



Patent No. 6558903 
GENERAL INFORMATION: 
APPLICANT: Hodge, Martin 

TITLE OF INVENTION: No. 6558903el Kinases and Uses Thereof 
FILE REFERENCE: 35800/183781 

CURRENT APPLICATION NUMBER: US/ 09/345 , 4 7 3E 
CURRENT FILING DATE: 1999-06-30 
NUMBER OF SEQ ID NOS : 62 

SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 4 0 
LENGTH: 1601 
TYPE: PRT 

ORGANISM: C. elegans 
US-09-345-473E-40 

Query Match 4.5%; Score 267.5; DB 4; Length 1601; 

Best Local Similarity 20.5%; Pred. No. 8e-08; 

Matches 229; Conservative 134; Mismatches 376; Indels 377; Gaps 56; 

Qy 29 REPEDEEEEEEEEE EDEDEDLEELEVLERKPAAGLSAAPVPTAPAAGA 7 6 

| | : | | III I I I II : 1 I I : I I I : I I I I 
Db 4 62 RAKEEEERIREEAEIKEELRLRAEAKEKEKERLEK-ERLEKKAAAAAAANPNPTP 515 

Qy 77 PLMDFGNDFVPPAPRGP LPAAPPVAPE RQPSWDPS-PVS — STVPAPSP 122 

: | | I I i II::: : I : II M : : : I I I I 

Db 516 IPPTPATPHSSAQQQPIPPPLSTQTSAEIQQSAQQPSVPVTMIANIPAMSP 566 

Qy 123 LSA AAVSPSK LPEDDEPPARPPPPPPASVSPQAEPVW TPPAPAP— AAPP 17 0 

|| : | | : : I : I | : I : I I I I I : I I I 

Db 567 T S AQ PQ P VL S PT S AAVP VP TTMI H VP KP S E I P VQNVAT T AAP VAANNVP P S PAP FKT EDI 626 

Qy 171 STPAAPKRRGSSGAWXXXXKIMDLKEQPGNTISAG-QEDFPSVLLETAASXPSLSPLSA 229 

||: II: | ::: | I I I II : 

Db 627 QTPTLAQ NTVPRTISTDASGLVINTPASIASPSPAPS 663 

Qy 230 AS FKEHEYLGNLSTVLP TEG TLQENVSEASKEVSEKAKTLLIDRDLT 27 6 

| : : : | | I : I I I I I I I : : I : 

Db 664 AT DVASTTAPVTPAPTPTTTTDGGAAAASTTTENKEEKRKSNKRKWMEILGCDES 719 



Qy 



277 E FSELEYSEMGSSFSVSP KAESAVIVANPREEIIVK 312 

I : I : I : I | :: I I I : 

Db 720 RNFALVSCRLDTSHKSVTFQFAPGTDKPCTIATKLLAEDCLLKVHVHIVEAQLGEVIQLI 779 

Qy 313 NKDEEE KLVSNNILHXQQELPTALTKLVKEDEWSSEKAKDSFNEKRVAVEAPMRE 368 

||:: I I : : I I : I : : : I | | | : : : : 
Db 7 80 NSDGKKGVGTKLAT — VLDPNSTEPPTITAVMPKD SSAATASNTKPKIEI 827 

Qy 369 EYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKVDKKCF7VDSLEQTNHEKDSESSN 428 

| : : | : : : :::: II I h M 

Db 828 EKTPPTRDASQE PNNVQVT NVRKVS Q E S N 856 

Qy 429 DDTSFPSTPEGIKDRSGAYITCAPFN PAAT ES 460 

: I I I I I I : I I I I I I 

Db 857 AE-SVQSIP RPGGIIVMSPTNQTDSAPPPTGAAAKPSRFQVTKSADPIATPI SSS 910 

Qy 461 IAT-NIFPLLEDPTSENXTDEKKIEEK-KAQIVTEKNTSTKTSNPFFVAAQDSETDYVTT 518 

1:1 : |:: I I I I I : ll::l I : h : : ' ' 



Db 911 ISTATV IPIVA-ATPTNITSEPVIVQPITAQVITHLATPSPVSHSL SSNSSPSATTH 966 

Qy 519 dnlTKVTEEWANMP-EGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLY 577 

| :::::: I | | : I I I : : I I 
Db 967 SNMSSI— QSTTSVPGRRFTVQPVSQA-ESGI SSSISTPHPEPT 1007 

Q y 578 PAAQLCPS FEES EAT PSPVLPDIVMEAPLNSAV PSAGASVIQPSSS PLEAS SVNY 632 

|| II II :| :| I I I Ml : I M: 

Db 1008 PAITSCP P P VP S VP P WSN GT LNL E VAP KQT P S ATNQNVDT QH S S S T AS TAT L 1060 



Qy 



633 ESIKHEPEN PPPYEEAMSVS LKVSGIKEEIKEPENINAA 671 

: I I I : I : : 1111= I : = 

Db 1061 — VSETPATVHVTPISVPAPVQEPLVIDHHSDVLTQLDSELRKVSGVSHS-ASPSTWES 1117 



Qy 672 LQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQPVPDHSELVEDSSPDSEPVD 731 

| I : I I : : I I : : = 
Db 1118 LT SMTPQT I PLACQTV- PAS I GQ AP AVI AAAHA AS 1151 

Qy 732 LFSDDSIPDVPQKQDETVMLVKESLTETSFESMIEYENKEKLSALPPEGGKPYLESFKLS 7 91 

| : |:| I : I II : Mill l : 
Db 1152 LIPNASVPQSPSRLD AETGLAGL HEKLEAL KME 1184 

Qy 792 LDNTKDTLLPDE VSTLSKKEKIPLQ MEELSTAVYSNDDLFI SKEAQIRET 841 

| :| : I: : I I : : I I : I I = — I" 
Db H85 QDRRED — MGDDAI GTTTTDGKDEI PIDTLKGLAEALGKVIHADG RET TP 1232 



Qy 



842 ETFSDSSPIEII -DEFPTLI SSKTDSFSKLAREYTDLEVS HKSEIANAP 889 

:|:| ::| I MM II III I : : I 

Db 1233 MP PDHPDLTDASTQQLI S P SNPDVLTTMS SAVEGSAS STMI EDI DASTSAVDASMMNSMP 1292 



0v 890 DGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGHTQAE 94 9 

| | : | :: : : I I I M : : I I 

1293 PGA QNSTDQIPAAMTLSMDQECAQSMTSSITR NTTGTKLAT 1333 



Db 



Qy 950 IESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSAD 985 

|:: III I Ml I I 

Db 1334 FENL ETALS STLGTHI RQPNAP S SRD 1359 



RESULT 15 
US-08-978-277A-4 

; Sequenced, Application US/08978277A 

; Patent No. 6582956 

; GENERAL INFORMATION: 

; APPLICANT: Gelman, Irwin H. 

TITLE OF INVENTION: TUMOR SUPPRESSOR GENE 
NUMBER OF SEQUENCES: 2 0 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Brumbaugh, Graves, Donohue & Raymond 
STREET: 30 Rockefeller Plaza 
; CITY: New York 

STATE: NY 
COUNTRY: USA 
; ZIP: 10112-0228 

COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
; COMPUTER: IBM Compatible 



OPERATING SYSTEM: DOS 
SOFTWARE: FastSEQ Version 1.5 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/97 8 , 277A 
FILING DATE: 
CLASSIFICATION: 514 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/665,401 
FILING DATE: 18-JUN-1996 
ATTORNEY /AGENT INFORMATION: 
NAME: Clark, Richard S 
REGISTRATION NUMBER: 26,154 

REFERENCE/DOCKET NUMBER: A30558 - 165/34008 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 212-4 08-2558 
TELEFAX: 212-7 65-2519 
TELEX: 

INFORMATION FOR SEQ ID NO: 4: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 1596 amino acids 
TYPE: amino acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 
HYPOTHETICAL : NO 
ANTI-SENSE: NO 
FRAGMENT TYPE: internal 
ORIGINAL SOURCE: 
US-08-978-277A-4 

Query Match 4.5%; Score 265; DB 4; Length 1596; 

Best Local Similarity 21.3%; Pred. No. l.le-07; 

Matches 232; Conservative 145; Mismatches 422; Indels 288; Gaps 47; 

2 EDLD — QS PLVS S SDS PPRPQPAFKYQFVREPEDE EEEEEEEEEDEDEDLEELEV 54 

|:|: :| : Ihll I I : I I II:: M : : I 

Db 6 20 EELEKVKSATLSSTDST VS EMQDEVKT VGEEQKP EEPKRRVDT SVSWEA 668 



Qy 



55 LERKPAAGLSAAPVPTAPAAGAPL MDFGNDFVPPAPR 91 

| :: | I I : | | || : : 

Db 669 LICVGSSKKRARKASSSDDEGGPRTLGGDSHRAEEASKDKEAGTDAVPASTQEQDQAQGS 728 

q 92 -GPLPAAPPVAPERQPSWDPSPVSSTVPAPSPLSAAAVSPSKLPEDDEPPARPPPPPPAS 150 

| M | I :|: I :| I III I I : 

729 SSPEPAGSPSEGEGVSTWE SFKRLVTPRKK SKSKLEEKAEDSS VEQLSTE 778 



Db 



Ov 151 VSPQAEPWTPPAPAPAAPPSTPAAPKRRGSSGAVV^XXXKIMDLKEQPGNTISAG 206 

: | | | : I 1:1 II" ; l 

779 IEPSREESWV SIKKFIPGRRKKRA DGKQEQATVEDSGPVEI 819 



Db 



Qy 207 QEDFPSVLLETAASXPSLSPLSAASFKEHEYL GN LSTVLPTEGTLQENVSE 257 

|| |:| |:: III : I I : II I I :l I 
Db 82 0 NEDDPNV PAWPLSEYNAVEREKMEAQGNTELPQLLGAVYVSE E 8 6,3 

Qy 258 ASKEVSEKAKTLLID— RDLTEFSELEYSEMGSSFSVSPKAESAVIVANPREEIIVKNKD 315 

||: : I I I : I I I : : I : I : I I M : ' : 

Db 864 LSKTLVliTVSVAVIDGTRAWSVEERSPSWISASW-EPLEHTAGEAMPPVEEVTEKDII 922 



Qv 316 EEEKLVSNNILHXQQELPTALTKLVKE DEWSSEKAKDSFNEKRVAVEAPMREEY 37 0 

|| | II:: I I::|:ll ■ I II I 

Db 923 AEETPV LTQTLPEGKDAHDDMVTSEVDFTS EAVTATET 960 

QV 371 ADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKVDKKCFADSLEQTNHEKDSESSNDD 430 

III ::| :ll::l =: = 1= I: -Ml 

961 S EALRT EEVT EAS GAEET T DMVS AVS QLT D S PDTTEEATPVQEVESGVLD 1010 

431 xs __ FPSTPEGIKDRSGAYITCAP FNPAATESIATNIFPLLEDPTSENXTDEK 481 

| : : : | : I II::: M 
1011 T E E E E RQT Q AT LQAVADKVK EES QVPATQTVQRTGSKALE KVE 1053 



Db 

Qy 

Db 



Qy 4 82 KIEEKKAQIVTEKNTSTKTSNPFFVA AQDSETDYVTTDNLTKVTEEWANMPEGL 53 6 

:: | | : : I I II I I I I I I I I I 

1054 EVEEDSEVLASEKEKDVMPKGPVQEAGAEHLAQGSETGQAT PESL 1098 



Db 



Qv 537 TPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSFEESEATPSPV 596 

|: I I I : : ::: s=|::::.l : : :ll I 

Db !Q99 EVPEVT-ADVDHVATCQVIKLQQLMEQAVAPESS — ETLTDSETNGSTP 1144 

Qv 597 L PD 1 VMEAPLNSAVPSAGASVT QP S S S PLEASSVN YES I KHEPENPP P YEEAMS 650 

|| : ::| I hi I : I I : : I I I I I 
Db 1145 LADSDTADGTQQDETIDSQDSKATAAVRQSQVTEEEAATAQKEEPSTLPNNVPAQE 12 00 

Qy 651 VSLKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVE 710 

| :|| |: |: : I :: :|:: I |: I I I I 

Db !201 EHGEEPG — RDVLEPTQQELTAAAVPVLAKTEVGQEGEVDWLD-GEKVKEE 1248 

Qv 711 QPVPDHSELVEDSSPDSE-PVDLFSDDSIPDVPQKQD-ETVMLVKESLTETSFESMIEYE 768 

Ml | |:| : |: I : I h |: : II I h :l I 

Db 1249 QEVFVH SGPNSQKAADVTYDSEVMGVAGCQEKESTEVQSLSLEEGEMETDVEKE 1302 

Qv 769 NKE-KLSALPPEG GKPYLESFKLSLDNTK DTLLPDEVSTL 807 

:| | : || III I 1:1 IH : 
Db 1303 KRETKPEQVSEEGEQETAAPEHEGTYGKPVLTLDMPSSERGKALGSLGGSPSLPDQ 1358 

Qy 808 SKKEKIPLQMEELSTAVYSNDDLFISKEAQIRETETFSDS S PI EI IDE FPTLISSK 863 

| | :|:: I I I : :: II I:: II I : IN 

Db 1359 DKAG C I EVQVQ S L DT T VT QT AEAV EKVIETWISETGESP-ECVGAHLLPAEKSSA 1413 

Qy 864 TDSFSKLAREYTDLEVSHKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFS— 920 

| | : : : | : I I : I I I : I : : : I I I 

Db 1414 TGGHWTLQHAEDTVPLGPESQ AESIPIIVTPAPES TLHPDLQGEISASQRE 1464 

Qy 921 DDFSKNGSATSKVLLLPPDVSALGHTQAEIESIVKPKVLEKEAEKKLP 968 

I I : I I I I I : : I : I I : : : I : 

Db 1465 RSEEEDKPDAGPDADGKESTAIEKVLKAEPEILELESKSNKIVLNVIQTAVDQFARTETA 1524 

Qy 969 SDTEKED 975 

: I I 

Db 1525 PETHAYD 1531 



Search completed: September 3, 2004, 16:10:38 
Job time : 27.2562 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 

Run on: September 3, 2004, 16:00:14 ; Search time 25.1547 Seconds 

(without alignments) 
4504.667 Million cell updates/sec 

Title: US- 09-830-972-2 9 

Perfect score: 5923 

Sequence: 1 MEDLDQSPLVSSSDSPPRPQ VKDAMAKIQAKIPGLKRKAE 1178 

Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 283366 seqs, 96191526 residues 

Total number of hits satisfying chosen parameters: 283366 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : 



PIR_78:^ 
pirl : 
pir2 : 
pir3 : 
pir4 : 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


787.5 


13. 


3 


2 


688 


11. 


6 


3 


671 


11. 


3 


4 


528 


8 


9 


5 


519.5 


8 


8 


6 


340.5 


5 


7 


7 


331 


5 


6 


8 


325.5 


5 


5 


9 


324.5 


5 


.5 


10 


320.5 


5 


.4 


11 


299.5 


5 


. 1 


12 


299 


5 


. 0 


13 


290.5 


4 


.9 



776 


2 


A46583 


208 


2 


160904 


267 


2 


A60021 


2484 


2 


T26216 


2607 


2 


T26215 


222 


2 


T26213 


7962 


2 


138346 


5327 


2 


T13564 


865 


2 


A47282 


873 


2 


A47283 


3488 


2 


T34418 


3924 


2 


S37431 


1274 


2 


T16251 



neuroendocrine- spe 
neuroendocrine- spe 
tropomyosin- relate 
hypothetical prote 
hypothetical prote 
hypothetical prote 
elastic titin - hu 
microtubule-associ 
calcium-binding pr 
calphotin - fruit 
hypothetical prote 
ankyrin 2, neurona 
hypothetical prote 



14 


289 


4. 


9 


2364 


2 


A56577 


microtubule-associ 


15 


289 


4. 


9 


2464 


1 


QRMSP1 


microtubule-associ 


16 


287.5 


4. 


9 


971 


2 


T19431 


hypothetical prote 


17 


285.5 


4. 


8 


1621 


2 


A82255 


hypothetical prote 


18 


278 


4 . 


7 


1948 


2 


S00485 


gene 11-1 protein 


19 


274 


4. 


6 


3507 


2 


T34513 


hypothetical prote 


20 


272.5 


4. 


6 


1829 


2 


T24583 


hypothetical prote 


21 


272.5 


4. 


6 


2361 


2 


T25752 


hypothetical prote 


22 


271 


4 . 


6 


2187 


2 


T30826 


nascent polypeptid 


23 


270 


4. 


6 


1230 


2 


T22458 


hypothetical prote 


24 


267.5 


4 . 


5 


1851 


2 


T19964 


hypothetical prote 


25 


263. 5 


4 . 


4 


2409 


1 


A60979 


versican precursor 


26 


262.5 


4. 


4 


6642 


2 


T29757 


protein UNC-89 - C 


27 


261 


4 . 


4 


1558 


2 


B71603 


RESA-H3 antigen PF 


28 


260.5 


4 . 


4 


3381 


2 


T42389 


versican precursor 


29 


259. 5 


4. 


4 


1828 


2 


A40115 


microtubule-associ 


30 


258 


4 . 


4 


1616 


2 


G64242 


cytadherence-acces 


31 


257.5 


4. 


3 


5170 


2 


T15348 


hypothetical prote 


32 


256 


4. 


3 


990 


2 


151618 


nucleolar phosphop 


33 


256 


4. 


3 


1634 


2 


T26517 


hypothetical prote 


34 


255. 5 


4 . 


3 


1684 


2 


JW0057 


gravin - human 


35 


253 


4. 


3 


1189 


2 


S56852 


hypothetical prote 


36 


252. 5 


4. 


3 


1224 


2 


T14007 


microtubule-associ 


37 


252 


4. 


,3 


4377 


2 


A55575 


ankyrin 3, long sp 


38 


250.5 


4, 


,2 


1824 


1 


QRHUMT 


microtubule-associ 


39 


250.5 


4. 


.2 


1830 


2 


A37981 


microtubule-associ 


40 


248 


4. 


.2 


1825 


2 


S13507 


microtubule-associ 


41 


248 


4 . 


.2 


1890 


2 


T04556 


hypothetical prote 


42 


248 


4. 


.2 


3421 


1 


WZBEB6 


367K tegument prot 


43 


246 


4, 


.2 


1110 


2 


151116 


NF-180 - sea lampr 


44 


244.5 


4 , 


. 1 


1029 


2 


T30351 


mucin-like protein 


45 


243 


4, 


.1 


1320 


2 


JC5630 


TCOF1 protein - mo 



ALIGNMENTS 



RESULT 1 
A46583 

neuroendocrine-specific protein, splice form A - human 

N; Contains: neuroendocrine-specific protein, splice form B 

C; Species: Homo sapiens (man) 

C;Date: 24-May-1996 #sequence_revision 24-May-1996 #text_change 05-Nov-1999 
C;Accession: A46583; 160903 

R;Roebroek, A.J.; van de Velde, H.J.; Van Bokhoven, A.; Broers, J.L.; Ramaeke 
F.C.; Van de Ven, W.J. 

J. Biol. Chem. 268, 13439-13447, 1993 

A;Title: Cloning and expression of alternative transcripts of a novel 
neuroendocrine-specific gene and identification of its 135-kDa translational 
product . 

A;Reference number: A46583; MUID: 93293865; PMID:7685762 
A;Accession: A46583 

A; Status: preliminary; translated from GB/EMBL/DDB J 
A;Molecule type: mRNA 
A; Residues: 1-776 <R0E1> 

A;Cross-references: GB:L10333; NID:g307306; PIDN : AAA59950 . 1 ; PID:g307307 
A;Accession: 160903 



A; Status: preliminary; translated from GB/EMBL/DDBJ 

A; Molecule type: mRNA 

A; Residues: 421-776 <ROE2> 

A; Cross-references: GB:L10334; NID:g307308; PIDN : AAA59951 . 1 ; PID:g307309 

C; Genetics : 

A; Gene: GDB : RTN1 ; NSP 

A; Cross-references : GDB:203968; OMIM: 600865 
A; Map position: 14q21-14q22 

Query Match 13.3%; Score 787.5; DB 2; Length 776; 

Best Local Similarity 31.9%; Pred. No. 3.1e-25; 

Matches 229; Conservative 91; Mismatches 190; Indels 207; Gaps 25; 

Qy 58 8 ESEATPSPVLPDI — VMEAPLNSA VPSAGASVIQPSSSPLE ASSWY 632 

I MINI:: II: I = I : : I I : I : I 

Db 141 EELGTPGPSLPDVPGIESRGLFSSDSGIEMTPAESTEVNKILADPLDQMKAEAYKYIDIT 200 

Qy 633 — ESIKHEPENPPPYEEA MSVSLKVSGIKEEIKEPENINAAL QET 675 

| : M : : : I I : : | : | I : : I I I : : : I 

Db 201 RPEEVKHQEQHHPELEDKDLDFKNKDTDISIKPEGVREPDK-PAPVEGKIIKDHLLEEST 259 

67 6 EAPYISIACDL IKETKLSAEPAPDFSDYSEMAKVEQPVPDHSELVED 722 

:| I M— :: |: :: 

Db 260 FAPYID DLSEEQRRAPQITTPVKITLTEIEPSVE TTTQEKTPEKQDICLK 309 

Qy 723 SSPDSEPVDLFS DDS IPDVPQKQ DETVMLVKESLTETSF 7 61 

Ml: I I III I : I || : :||: 
Db 310 PSPDTVPTVTVSEPEDDSPGSITPPSSGTEPSAAESQGKGSISEDELITAIKEA 363 

Qy 762 ESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELS 821 

: I I I : :| :: : 

Db 364 -KGLSYETAENPRPVGQLADRP EVKARSGPPTIPSPLDHEA 403 

Q y 822 TAVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFP TLISSKTDSFS 868 

:: | | I I I:: I I : I II 

Db 404 SSAESGD- S EI ELVS EDPMAAEDALP S GYVS FGHVGGP P P S P 444 

Qv 869 KLAREYTDLEVSHKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSD 921 

: || : |: : I : : I : I I I I I I 

Db 445 ASPSIQYSILREEREAELDSELIIESCDASSAS EESPKREQDSPPMKPSALD 496 



Qy 



Qy 



Qy 

Db 



922 DF SKNGSATSKVLL LPPDVSALGHTQAEIESIVKP 95 6 

| : | | I III II I I : : I 

Db 497 AIREETGVRAEERAPSRRGLAEPGSFLDYPSTEPQPGPELPPGDGAL EPETPMLP 551 

957 KVLEKEAEKKLPSDTEKEDRSPSA 1 FSADLGKT S WDLLYWRDI K 1001 

: | : :: | | : | : I II : I I I I I I I I I 
552 RKPEEDSSSNQSPAATKGPGPLGPGAPPPLLF LNKQKAIDLLYWRDIK 599 

Qy 1002 KTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRA 1061 

:||:MI: I II Ml II : I I I I I : I I I II IIIIIIM I : I I : I I = I I I I I I : I 
Db 600 QTGIVFGSFLLLLFSLTQFSWSWAYIAL^ALSATISFRIYKSVLQAVQKTDEGHPFKA 659 

Qy 1062 YLESEVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 1121 

Ml |: :|:| :\\\:: :|| I = I I I t I I I t I I I I I I M I I I I I I : I I I I I I I 

Db 660 YLELEITLSQEQIQKYTDCLQFYVNSTLKELRRLFLVQDLVDSLKFAVLMWLLTYVGALF 719 



Qy 1122 NGLTLLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1178 

|||MI::|::|:|::ll:i : I I I I I I INI :||lllllll II M 

Db 720 NGLTLLLMAWSMFTLPWYVKHQAQIDQYLGLVRTHINAWAKIQAKIPGAKRHAE 77 6 



RESULT 2 
160904 

neuroendocrine-specif ic protein C - human 
C; Species: Homo sapiens (man) 

C;Date: 24-May-1996 ffsequence_revision 24-May-1996 #text_change 05-Nov-1999 
C; Accession: 160904 

R;Roebroek, A. J.; van de Velde, H.J.; Van Bokhoven, A.; Broers, J.L.; Ramaekers, 
F.C.; Van de Ven, W.J. 

J. Biol. Chem. 268, 13439-13447, 1993 

A; Title: Cloning and expression of alternative transcripts of a novel 
neuroendocrine-specif ic gene and identification of its 135-kDa translational 
product . 

A; Reference number: A46583; MUID: 93293865; PMID: 7685762 
A; Accession: 160904 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: mRNA 
A; Residues: 1-208 <RES> 

A;Cross-references: GB:L10335; NID:g307310; PIDN: AAA59952 . 1; PID:g307311 

C; Genetics : 

A; Gene: GDB : RTN1 ; NSP 

A;Cross-references: GDB:203968; OMIM: 600865 
A; Map position: 14q21-14q22 

Query Match 11.6%; Score 688; DB 2; Length 208; 

Best Local Similarity 67.5%; Pred. No. 6.2e-22; 

Matches 129; Conservative 32; Mismatches 30; Indels 0; Gaps 0; 

KTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 1047 

| : : | || I I I I I I : II : I I I : I II II I I I : I I I I I : I I I I I I I I I I I I I I : 
KSQAIDLLYWRDIKQTGIVFGSFLLLLFSLTQFSWSWAYLALAALSAT1SFRIYKSVL 77 

QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKF 1107 

I I : I I : I I I I II : I I I I I: :| = l :lll:: :M hlllllll Mill 

QAVQKTDEGHPFKAYLELEITLSQEQIQKYTDCLQFYVNSTLKELRRLFLVQDLVDSLKF 137 

AVLMWFTWGALFNGLTLLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAM7VKIQ 1167 

Mill: I I I I I I I I I I I I i : : I : : I : I : : I I : I : I I I I I I I I I I : : : I M I 
AVIjMWLLTYVGALFNGLTLLLMAVVSMFTLPVVYVKHQAQIDQYLGLVRTHINAVVAKIQ 197 

AKI PGLKRKAE 1178 
I I I I I II M 



RESULT 3 
A60021 

tropomyosin-related protein, neuronal - rat 
C; Species: Rattus norvegicus (Norway rat) 

C;Date: 03-Mar-1993 #sequence_revision 03-Mar-1993 #text_change 05-Nov-1999 

C;Accession: A60021 

R;Wieczorek, D.F.; Hughes, S.R. 

Brain Res. Mol . Brain Res. 10, 33-41, 1991 



Qy 


988 


Db 


18 


Qy 


1048 


Db 


78 


Qy 


1108 


Db 


138 


Qy 


1168 


Db 


198 



A;Title: Developmentally regulated cDNA expressed exclusively in neural tissue. 
A;Reference number: A60021; MUID: 91278684 ; PMID:1647480 
A; Accession : A60 021 
A;Molecule type: mRNA 
A; Residues : 1-267 <WIE> 

A;Cross-references: EMBL:X52817; NID:g456549; PIDN : CAA37001 . 1 ; PID:g456550 
C; Comment: This neuronal-specif ic mRNA was identified by hybridization to an 
alpha-tropomyosin probe but does not show homology in amino acid sequence. 

Query Match 11.3%; Score 671; DB 2; Length 267; 

Best Local Similarity 66.3%; Pred. No. 4.4e-21; 

Matches 124; Conservative 34; Mismatches 29; Indels 0; Gaps 0; 

Qy 988 KTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSWAYIAIJ^LSVTISFRIYKGVI 1047 

I : : I I I I I I I I I : I I : I I I : I II I I I I I : I I I I I : I I I II MINIM I : 
Db 9 KSQAIDLLYWRDIKQTGIVFGSFLLLLFSLTQFSWSWAYLALAALSATISFRIYKSVL 68 

Qy 1048 QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKF 1107 

I I : || : I I I I I I : I I I I I : : I : I : I I I : : Ml I M I I I I II II I II M I I I 

Db 69 QAVQKTDEGHPFKAYLELEITLSQEQIQKYTDCLQLYVNSTLKELRRLFLVQDLVDSLKF 12 8 

Qy 1108 AVLMWFTWGALFNGLTLL1IALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQ 1167 

Mill: I I I I II I I I I I I I : M :: I M : M I M M I I M I MM : : Mill 
Db 12 9 AVLMWLLTWGALFNGLTLLLMAVVSMFTLPVVYVKHQAQV^ 18 8 

Qy 1168 AKIPGLK 1174 

II II I : 

Db 189 AKIPGAR 195 



RESULT 4 
T26216 

hypothetical protein W06A7.3c - Caenorhabditis elegans 
C; Species: Caenorhabditis elegans 

C;Date: 15-Oct-1999 #sequence_revision 15-Oct-1999 #text_change 15-Oct-1999 
C;Accession: T26216 
R;Ainscough, R. 

submitted to the EMBL Data Library, August 1996 
A; Reference number: Z20173 
A;Accession: T26216 

A; Status: preliminary; translated from GB/EMBL/DDBJ 

A; Molecule type: DNA 

A; Residues: 1-2484 <WIL> 

A; Cross-references: EMBL:Z78066; PIDN : CAB514 67 . 1 ; GSPDB : GN00023 ; CESP : W06A7 . 3c 

A; Experimental source: clone W06A7 

C; Genetics: 

A; Gene : CESP : W06A7 . 3c 

A;Map position: 5 

A;Introns: 1827/1; 1866/3; 1963/3; 1990/3; 2262/1; 2289/1; 2339/2; 2463/2 

Query Match 8.9%; Score 528; DB 2; Length 2484; 

Best Local Similarity 20.9%; Pred. No. 7.4e-14; 

Matches 277; Conservative 186; Mismatches 461; Indels 402; Gaps 54; 

Qy 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEE EE 38 

: I :: I MM I I M I : M 

Db 1397 LEKVEVQP — DLSQNSPAPHKIIDLHF-NI PKDHEDYGNDYVPFGTESSEESQKADGNQE 1453 



Qy 



39 EEEEED EDEDLEELEVLERKPA AGLSAAP 67 

: I I I I I M : I : I : I : I : 

Db 1454 NQEEEDWAELNFHPIRQWRDEDVISLQSLKSLVAEVGCITDVDASDVNEQDEESTLKIL 1513 



Qy 



68 — VPT APAAGAP LMD FGND FVP PAP RGP L P AAP P VAP - - E RQ P S W 108 

I I : I : : II I I I I : I I I I 

Db 1514 KWP SEPSLLE--LDFTND - - PKVI HVP I P LME PATMYLEEMVEWI I ADAVKEVS EMEW 1569 



Qy 



109 DPSPVSSTVPAPSPLSAAAVSPSKLP-EDDEPPARPPPPPPASVSPQAEPVW 159 

| I I I I I I : I I I I I I I II I I : I : 

Db 1570 TESEISEMAPQVSESTCPIPEPL ADLKLPVEDDEKTPEPEPWPGQVQERIIPIE 1624 



Qy 160 TPPAPA-PAAPPSTPAAPKRRGSSGAWXXXXKIMDLKEQPGNTISAGQEDFPSVL— LE 216 
I I I I I I : : I i I : 

1625 VEQAPTIPQRPPRAP KSELPKVAKPLD 1651 



Db 

Qy 



217 TAASXPSLSPLSA ASFKEHEYLGNLSTVLPTEGTLQENVS EASKEVSEKAKTLL 27 0 

: | :||: : :: | : :| Mil: : = I = I 

Db 1652 DSKSRVRFAPLNIKLGRTYSEEQQKELVESLERPL-TIITQQKPPEKPTEDIG— ALSPL 1708 



Qy 271 IDRDLTEFSELEYSEMGSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHXQQ 330 

| |: |: :|| : :|:| Ml : :| I 
Db 1709 S PNTLAEYEEVPMMDMQS VPHSPQE KQEEIEALSEII 1745 

Qy 331 ELPTALTKLVKEDEWSSEKAKDSFNEKRVAVEAPMREEYADFK PFERVWEVKDSKE 387 

I |: :: I Ml ' \ : : I : I M I : M I : 

Db 1746 EEPQAMKEVEKPVE-SAPEKDNESLEAPEI-INEPIRRVLVETKIMGPGKSLNEDNDDDD 1803 

Qy 388 D-SDMIAAGGKIESNLESKVDKKCFADSLEQTNHEKDSESS NDDTSFPSTPEGIKD 442 

|| M M : I : : I I = = : : M II I M M : 

Db 1804 DGSECLDSIGDLS ERTI QR — FNT S I DDP S I RRDS FS S I S S FGDRQKFRTAI EN I RQ 1858 

Qv 443 RSGAYITCAPFNPAATESIATNIFPLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSN 502 

|| :::::: I : :ll 
Db 1859 DLLPFQSSVSQYLRSSPNP SQQLLVTN 18 85 

Qy 503 PFFVAAQDSETDYVTTDNLTKVTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETK 562 

M I : I M III M : : M :: I : : I 

Db 1886 LSMDSPSD — LSPNAPPVGFENTAQFLEKLQQE-DRPSAEGSIDSSGFEKVDHE 1936 

Q V 563 MDLVQT S EVMQES L YPAAQLC P S FEE S EAT P S PVLPDI VMEAPLN S AVP SAGAS VI QP S S 622 

:| I I I : i I I: Ml 

1937 GLDEFAAPP VHDPMQKSVFGSLGSDDMKPGS 1967 



Db 

Qy 



623 SP LEAS S W YES IKHEPENPPPYEEAMSVSLKVSGIKEEIKEPENIN AAL 672 

: | : | | : M : I :M I Ml 

Db 1968 QDDGFVFIERNEAN EATLKKNQKMSSHHNDVIEKNYFNDNAPTAAL 2013 



Qy 673 QETEAPYISIACDLIKETKLSAEPAPDFSDYSEMA -KVEQPVPDHSEL 719 

|:| I |::: II Ml M MM I 

Db 2014 - - LE S P I AE EARKLVQ DAVE S A SEYKKQAVDS GDEI GRELLDNVEQKI EQVKEP 2065 

Qy 720 VEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMIEYENKEKLSALPPE 779 

: | | : I I II I M :: I I : I I M 
Db 2066 I VDSL — HKAYDGVGDFVHETVPNAVDDFVREAEKQLPESPVPEKIE TPE 2113 



Qy 780 GGKPYLESFKLSLDNTKDTL--LPDEVSTLSKKEKIPLQMEELSTAVYSNDDLFISKEAQ 837 

I : II: : III ::l I I I hi = 

Db 2114 PLVDIHDTVDKVHDEVDNFLRREPTP-PFETDDVAPLSDDKPQFGNQTP 2161 

Qy 838 IRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEIANAPDGAGSLPC 897 

:|| I : I : I ::: : :: I I : III: 

Db 2162 EEDETTFDRKGPLTIPEEVEKAAAAQNNDLD D FD P LVT S NT GAAF GAAVGAAAA- - 2215 

Qy 8 98 TELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGHTQ7VEIESIVKPK 957 

: : I I : : II : I : : : I 

Db 2216 VESLTEEEM FGH — QKFETVPRPP 2237 

Qy 958 VLEKEAEKKLPSDTEKEDRSPSAIFSADLG KT — 98 9 

I I I I I I :: M II 
Db 2238 T PPKDISDEDVKPSTV NLGPSHHHSHPSSPHHSILKHHGDAWIDFKTVP 2286 

Qy 990 -SWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQ 1048 

I : I : : I I I I I I : : I : I : I I : : : : I I I I I I : : I I 

Db 2287 PCVLDVIYWRDAKKSAIVLSLALLVLFVLAKYPLLTVVTYSLLL^ 2346 

Qy 1049 AIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKFA 1108 

|:|:| Ml | :: : :| | :: : I I :|::| |: ::|:M 
Db 2347 QIKKTDSEHPFSEILAQDLTLPQEKVHAQADVFVEHATCIANKLKKLVFVESPLESIKFG 2406 

Qy 1109 VLMWVFTWGALFNGLTLLILALISLFSVPVIYERHQAQIDHYLGIANKNVK 1168 

: : : I I I : : I : I I I II I : : I I I I : I I : I 11=1 : : : I : I 
Db 2407 LVLWSLTYIASWFSGFTLAILGLLGVFSVPKVYESNQEAIDPHLATISGHLKNVQNIIDE 2466 

Qy 1169 KIPGLK 1174 

I : I I : 

Db 2 4 67 KLPFLR 2472 



RESULT 5 
T26215 

hypothetical protein W06A7.3a - Caenorhabditis elegans 
C ; Species : Caenorhabditis elegans 

C;Date: 15-Oct-1999 #sequence_revision 15-Oct-1999 #text_change 15-Oct-1999 
C;Accession: T26215 
R;Ainscough, R. 

submitted to the EMBL Data Library, August 1996 
A;Reference number: Z20173 
A; Access ion: T26215 

A; Status: preliminary; translated from GB/EMBL/DDBJ 

A;Molecule type: DNA 

A; Residues: 1-2607 <WIL> 

A;Cross-references: EMBL:Z78066; PIDN : CAB01522 . 2 ; GSPDB : GN00023 ; CESP : W06A7 . 3a 

A; Experimental source: clone W0 6A7 

C; Genetics: 

A; Gene: CESP : W06A7 . 3a 

A;Map position: 5 

A;Introns: 1827/1; 1866/3; 1963/3; 1990/3; 2262/1; 2289/1; 2412/1; 2462/2; 
2586/2 



Query Match 8.8%; Score 519.5; DB 2; 

Best Local Similarity 20.4%; Pred. No. 1.8e-13; 
Matches 292; Conservative 175; Mismatches 474; 



Length 2607; 
Indels 491; Gaps 



55; 



Qy 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEE EE 38 

:| :: I I : I I " I 1:1 I: :l 

Db 1397 LEKVEVQP--DLSQNSPAPHKIIDLHF-NIPKDHEDYGNDYVPFGTESSEESQKADGNQE 1453 

Qy 39 EEEEED EDEDLEELEVLERKPA AGLSAAP 67 

: | I I I M I : I : I : I : I : 

Db 1454 NQEEEDWAELNFHPIRQWRDEDVISLQSLKSLVAEVGCITDVDASDVNEQDEESTLKIL 1513 

Qy 68 — VP T APAAGAP LMD FGN D FVP PAP RG P L P AAP P VAP — ERQPSW 108 

I I : I : : I I I I I hi I I I 

Db 1514 KWPSEPSLLE — LDFTND--PKVIHVPI PLMEPATMYLEEMVEWIIADAVKEVSEMEW 1569 

Qy 109 DPSPVSSTVPAPSPLSAAAVSPSKLP-EDDEPPARPPPPPPASVSPQAEPVW 159 

I II I I I I : I I I I M I | | | | : I : 

Db 1570 TESEISEMAPQVSESTCPIPEPL ADLKLPVEDDEKTPEPEPWPGQVQERI I PI E 1624 

Qy 160 TPPAPA-PAAPPSTP AAPKRRGS S GAWXXXXKIMDLKEQPGNTI SAGQE 208 

I I I I I I II I : I : I I I I : 

Db 1625 VEQAPTIPQRPPRAPKSELPKVAKPLDDSKS RVRFAPLNIKLGRTYSEEQQKEL 1678 

Qy 2 09 DFPSVLLETAASXP SLSPLSAASFKEHEY — LGNLSTVLPTEGTLQEN 254 

: I : : I I : I I I I I : I : I : : : : I : II 

Db 1679 VESLERPLTII-TQQKPPEKPTEDIGALSPLSPNTLAEYEEVPMMDMQSVPHSPQEKQEE 1737 

Qy 255 VSEASKEVSEKAKTLLIDRDLTEFSELEYSEMGSSFSVSPKAESAVIVANPREEIIVK-- 312 

: | | I : I : : : I : I : : I : I : I : : I : 

Db 1738 I-EALSEIIEEPQAM KEVEKPVESAPEKDNESLEAPEI INEPIRRVLVETK 1787 

Qy 313 NKDEEEKLVSNNILHXQQELPTALTKLVKEDEWSSEKAKDSFNEKRVAVEA 364 

|:| :: : I :| II: II ::: 
Db 1788 IMGPGKSLNEDNDDDDDGSECLDSIGDL SERTIQRFN TSIDD 1829 

Qy 365 P — MREEYADFKPFERVWEVKDSKED--SDMLAAGGKIESNLESKVDKKCFADSLEQTNH 420 

| | : :: | : : : I : I : I : I I : : I I I 

Db 1830 PSIRRDSFSSISSFGDRQKFRTAI ENIRQDLLPFQSSVSQYLRSSPNP SQQLLVTNL 1886 

Qy 421 EKDSESSNDDTSFPSTPEGI KDRSGA — YITCAPFNPAATESIATNI F 466 

III I I : I I : M I 1:1 I : 

D b 1887 SMDSPS DLSPNAPPVGFENTAQFLEKLQQEDRPSAEGSIDSSGFEKVDHEGLDEFAA 1943 

Qy 467 PLLEDPTSEN XTDEKK IEEKKAQIVTEKNTSTKTSNPFFVAAQDS 511 

I : I I : : : I : I I I : I II : I : I : : 

Db 1944 PPVHDPMQKSVFGSLGSDDMKPGSQDDGFVFIERNEANEATLKKNQKMSSHHNDVIEKNY 2003 

Qy 512 ETDYVTTDNLTKVTEEWANMPEGLTPDLVQEAGESELNEV-TGTKIAYE TKMDLV 566 

I | : I : I | | | : | | : I : I : I I : : 

Db 2004 FNDNAPT AALLESPIAEEARKLVQDAVESASEYKKQAVDSGDEIGRELLDNVEQKIE 2 060 

Qy 567 QTSEVMQESLYPAAQLCPSFEESEATPSPVLPDIVMEAPLNSAVP SAGAS VIQPSSSPLE 626 

I I : : II : I I I I : I I I I I 
Db 2061 QVKEP I VDS LHKAYDGVGDFVH- ETVPNAV-DDFVREAE 2097 

Qy 627 AS SVNYESIKHEPENPPPYEEAMSVSLKVSGIKEEIKEPENINAALQETEAPYISIACDL 686 

111:11 I : I : I I I : I : 

Db 2098 KQLPESPVP EKIETPE PLVDIHDTV 2122 



Qy 687 IK ETKLSAEPAPDFSDYSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDV 741 

I : I I I I i : I : I i : I : : 
Db 2123 DKVHDEVDNFLRREPTPPFE TDDVAPLSDDKPQFGNQT 2160 

Qy 742 PQKQDETVMLVKESLTETSFESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLP 801 

: : I II I I ! : I 

Db 2161 -PEEDETTFDRKGPLT- — IP 2177 

Qy 802 DEVSTLSKKEKIPLQMEELSTAVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLIS 861 

: I I | : | : | | : I : I I :: 

Db 217 8 EEV EKAAAAQNND LDDFDPLVT 2199 

Qy 862 SKT DSFSKLAR EYTDLEVSHKSEIANAPD 890 

|| I : I : : | : | | : | 

Db 2200 SNTGAAFGAAVGAAAAVESLTEEEMFGHQKFETVPRPPTPPKDISDEDVK-PSTVNLGPS 2258 

Qy 891 GAGSLPCTELPHD LSLKNIQPKVEEKI S FSDDFS 924 

11:11 : | : | : | I : I I 

Db 2259 HHHSHPSS — PHHS I LKHHGDAWI DFKTVPPCAQNAFS PGEIMFLLAFFVYLSCFAS FFS 2316 

Qy 925 KNGSATSKVLLL PPDVSALG 944 

I : : I I : I I I 

Db 2317 KSLPLLDNLLSLWYLSISLIIHVKHHRKFRWNEEQATTMSKLGAVGRGLYALIAFIVNI 2376 

Qy 945 — HTQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKK 1002 

: : I I III I I I I I : I :: I I II I I 

Db 2377 VL RVGLNVALVVGVAVS AH EAYKLT K S SGVLRKKEVLDVIYWRDAKK 2423 

Qy 1003 TGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAY 1062 

: : | : | : | I : : : : I I I I I i : : I i I : I : I I I I 

Db 2424 SAIVLSIALLVLFVLAKYPLLTWTYSLLI^ 2483 

Qy 1063 LESEVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFN 1122 

| : : : : | | : : : I I : I : : I I : : : I : I I : : : I I I : : I : 
Db 2484 LAQDLTLPQEKVHAQADVFVEHATCIANKLKKLVFVESPLESIKFGLVLWSLTYIASWFS 2543 

Qy 1123 GLTLLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLK 1174 

I I I I I I : : I I I i : II : I I I : I : : : I : I I : I I : 

Db 2544 GFTLAILGLLGVFSVPKVYESNQEAIDPHLATISGHLKNVQNIIDEKLPFLR 2595 



RESULT 6 
T26213 

hypothetical protein W06A7.3b - Caenorhabditis elegans 
C; Species: Caenorhabditis elegans 

C;Date: 15-Oct-1999 #sequence_revision 15-Oct-1999 #text_change 15-Oct-1999 
C;Accession: T26213 
R;Ainscough, R. 

submitted to the EMBL Data Library, August 1996 
A; Reference number: Z20173 
A;Accession: T26213 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: DNA 
A; Residues: 1-222 <WIL> 

A; Cross-references: EMBL:Z78066; PIDN : CAB01523 . 1 ; GSPDB : GN00023 ; CESP : W06A7 . 3b 
A; Experimental source: clone W06A7 
C; Genetics: 



A; Gene: CESP : W06A7 . 3b 
A;Map position: 5 
A;Introns: 27/1; 77/2; 201/2 



Query Match 5.7%; Score 340.5; DB 2; Length 222; 

Best Local Similarity 32.4%; Pred. No. 1.4e-07; 

Matches 66; Conservative 52; Mismatches 83; Indels 3; Gaps 2; 

Qy 972 EKEDRSPSAIFSADL-GKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIA 1030 

: I : | : : : I II : : I : : I I I I I I : : I : I : I I : : : : I I 
Db 9 KKYSKQPTWVPATDFPGK — ILDVI YWRDAKKSAIVLSLALLVLFVLAKYPLLTWTYSL 66 

Qy 1031 LALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNCTIK 1090 

I I I I :: I I I : I : I III i :: : : I I :: : I I 

Db 67 LLALGAAAGFRVFKKVEAQIKKTDSEHPFSEIIAQDLTLPQEKVHAQADVFVEHATCIAN 126 

Qy 1091 ELRRLFLVDDLVDSLKFAVLMWFTYVG7^LFNGLTLLILALISLFSVPVIYERHQAQIDH 1150 

: | : : I | : : : I : I I : : : I I I : : I : I I I I I I : : I I I i : I I : i II 
Db 127 KLKKLVFVES PLES I KFGLVLWSLT YI ASWFS GFTLAI LGLLGVFS VPKVYESNQEAI DP 186 

Qy 1151 YLGLANKNVKDAMAKIQAKIPGLK 1174 

: t : : : I : I I : I I : 

Db 187 HLATISGHLKNVQNIIDEKLPFLR 210 



RESULT 7 
138346 

elastic titin - human (fragment) 
C; Species: Homo sapiens (man) 

C;Date: 29-May-1998 #sequence_revision 29-May-1998 #text__change 21-Jul-2000 
C;Accession: 138346 
R;Labeit, S.; Kolmerer, B. 
Science 270, 293-296, 1995 

A; Title: Titins : giant proteins in charge of muscle ultrastructure and 
elasticity. 

A; Reference number: A57430; MUID : 96026330 ; PMID: 7569978 
A;Accession: 138346 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A; Residues: 1-7962 <RES> 

A;Cross-references: EMBL:X90569; NID : gl017426; PIDN : CAA62189 . 1; PID:gl017427 

C; Genetics: 

A; Gene: GDB : TTN 

A;Cross-references : GDB: 127867; OMIM: 188840 
A;Map position: 2q31-2q31 

Query Match 5.6%; Score 331; DB 2; Length 7962; 

Best Local Similarity 20.0%; Pred. No. 4.7e-05; 

Matches 235; Conservative 144; Mismatches 404; Indels 392; Gaps 48; 

Qy 32 EDEEEEEEEEEEDEDEDLEELE — VLERK PAAGLSAAPV PTAPA 73 

I : I I I I I I I I I : I I I I : I I : : I I I I I 

Db 6176 EEEREEEEEAEWEYEVMEEPEEYVVEEKLHIISKRV^EAEPAEVTERQEKKIVLKPKIPA 6235 

Qy 74 AGAPLMDFGNDFVPPAPRGPLPAAP-PVAPERQPSWDPSPVSSTVPAPSPLSAAAVSPSK 132 

: | : | | I : I I : : I I I I I I I I 

Db 6236 K IEEPPPAKVPEAPKKIVPEKK VPAPVP-KKEKVPPPK 6272 



Qy 133 LPEDDEPPA RPPP PPPASVS PQAEPVWTPPAPAPAAPPS-TPAA 175 

: I I : : I : 1 I I I I I : III I I : I 

Db 6273 VPEEPKKPVPEKKVPPKVIKMEEPLPAKVTEKHMQITQEEKVLVAVTKKEAPPKARVPEE 6332 

Qy 176 PKRRGSSGAWXXXXKIMDLK EQPGNTI SAGQEDF PSVLLETAASXPS-LSPL 227 

Ml I : : I I I : I : : : : I : I I : : 

Db 6333 PKR AVPEEKVLKLKPKREEEPPAKVTEFRKRWKEEKVSIEAPKREPQPIKEV 6385 

Qy 228 SAASFKEHEYLGNLSTVLPTEGTLQEN-VSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 286 

: I | I Ihlll : I I : : : I I I I I : I 

Db 6386 TIMEEKERAY TLEEEAVSVQREEEYEEYE EYDYKEFEEYEPTEE 6429 

Qy 287 GSSFS VSPKAESAVIV — ANPREEI 309 

: | | | | : : | | | : : 

Db 6430 YDQYEEYEEREYERYEEHEEYITEPEKPIPVKPVPEEPVPTKPKAPPAKVLKKAVPEEKV 6489 

Qy 310 IV KNKDEEEKLVSNNI LHXQQELPTALTKLVKEDEVVS SEKAKDS FNEKRVAVEAP 365 

| || I : :::::: | | : | | | : II III I 

Db 6490 PVPIPKKLKPPPPKVPEEPKKVFEEKIHISITK — REKEQVTEPAAKVPMKPKRWAEEK 6547 

Qy 366 MREEYADFKPFERWEVKDSKEDSDMLAAGGKIESNLESKVDKKCFADSLEQTNHEKDSE 425 

: : I II II I :: | : : |: : : I : I I : : I 

Db 6548 VPVPRKEVAPPVRVPEVPKELEPEEV AFEEEWTHVE-EYLVEEEEEYIHEEE-E 6600 

Qy 426 SSNDDTSFPSTPEGIKDRSGAYITCAP FNPAATESIATNIFPLLEDPTSENXTDEKKIEE 485 

: : I I : : III : II" I I I I 

Db 6601 FITEEEWPVIPVKVPE VPRKPVPEEKKPVPVPKKKEAPPAKVPEVPKKPEE 6652 

Qy 486 KKAQIVTEKNTSTKTSNPFF VAAQDSETDYVTTDNL 521 

I :: :| I | | :: I : 

Db 6653 KVPVLIPKKEKPPPAKVPEVPKKPVPEEKVPVPVPKKVEAPPAKVPEVPKKPVPEKKVPV 6712 

Qy 522 TKVTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEV 571 

Ml :M | | : I: :| : : I I II 

Db 6713 PAPKKVEAPPAKVPEVPKKLIPEEKKPTPVPKKVEAPPPKVPKKREPVPVPVALPQEEEV 6772 

Qy 572 M-QESLYPAAQLCPSFEESEATP 593 

: : I : I :: I III I 
Db 6773 LFEEEIVPEEEVLP — EEEEVLPEEEEVLPEEEEVLPEEEEIPPEEEEVPPEEEYVPEEE 6830 

Qy 594 S P VL PDIVMEAP LN S AVP SAGAS VI Q PSSSPLEASSVNYESI- 635 

I I I : : : I : : I I I : I : I I : I 

Db 6831 EFVPEEEVLPEVKPKVPVPAPVPEIKKK\^EKKWIPKKEEAPP7^PEVPKKVEEKRII 6890 

Qy 636 — KHE— PENPPPYEEAMSVSLKVSGIKEEI KEPENIN 669 

II I I I I I I | | : : I I I 

Db 6891 LPKEEEVLPVEVTEEPEEEPISEEEIPEEPPSIEEV EEVAPPRVPEVIK 6939 

Qy 67 0 AALQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQPVPDHSELVEDSSPDSEP 72 9 

1:1 I 1:11 | | | : : | : I : : I 

Db 6940 KAVPEAPTPV PKKVEAPP AKVSKKI PEEKVPVPVQKKEAPP 6980 

Qy 730 VDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMIEYENKEKLSALPPEGGKPYLESFK 789 

:|:||:| I :M I |:|l 1= 
Db 6981 A KVPEVPKKVPEKKVLV PKKEAVPPAKGR 7009 



Qy 790 LSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVYSNDDLFISKEAQIRETETFSDSSP 849 

| : | :: I I :: I : : | | :: I : I : II:: 

Db 7 010 TVLEEKVSVAFRQEVWKERLELEWEAEVEE — I PEEEEFHEVEEYFEEGE 7 059 

Qy 850 IEIIDEFPTLISSKTDSFSKLAREYTDLEV— SHKSEIANAPDGAGSLPCTELPHDLSLK 907 

: : I I I : : : : : : : | | : : I : I I : : I I 

Db 7060 FHEVEEFIKLEQHRVEEEHRVEKVHRVI EVFEAEEVEVFEKPKAPPKGP EISEK 7113 

Qy 908 NIQPK VEEKISFSDDFSKNGSATSKVLLLPPDVSAL 943 

III INI::: Ml 
Db 7114 IIPPKKPPTKWPRKEPPAKVPEVPKKIWEEKVRVPEE PRVPP 7157 

Qy 944 GHTQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSP 978 

: : : : I I I I II M I : : I 

Db 7158 TKVPEVLPPK — EWPEKKVPVPPAKKPEAP 718 6 



RESULT 8 
T13564 

microtubule-associated protein homolog - fruit fly (Drosophila melanogaster ) 
N; Alternate names: hypothetical protein EG:4 9E4.1 
C; Species: Drosophila melanogaster 

C;Date: 13-Aug-1999 #sequence_revision 13-Aug-1999 #text_change 17-Nov-2000 
C; Accession: T13564 

R;Spanos, L.; Papagiannakis , G. ; Siden-Kiamos r I.; Louis, C. 
submitted to the EMBL Data Library, April 1999 

A; Description: Sequencing the distal X chromosome of Drosophila melanogaster. 
A;Reference number: Z17689 
A; Accession: T13564 

A; Status: preliminary; translated from GB/EMBL/DDBJ 

A;Molecule type: DNA 

A; Residues: 1-5327 <SPA> 

A/Cross-references: EMBL :AL03 1128 ; PIDN: CAA2 000 6. 1 
C; Genetics: 

A; Cross-references : FlyBase : FBgn0025392 

A;Introns: 24/2; 52/3; 104/3; 179/1; 232/1; 1669/3; 2566/1; 4798/3; 5272/1 
A;Note: EG:49E4.1 

C; Superf amily : Drosophila 576K microtubule-associated protein homolog 

Query Match 5.5%; Score 32 5.5; DB 2 ; Length 5327; 

Best Local Similarity 23.6%; Pred. No. 4.6e-05; 

Matches 275; Conservative 151; Mismatches 520; Indels 219; Gaps 53; 



Qy 


28 


VREPEDEEEEEEEEEE DEDEDLEELEVLERKPAAGLS AAPVPTAPAAG 

: : :| M 1 1 : : 1 1 1 : | | : : : II 1 : 
MEQVKDKEEHEQKIESGIITEKEAKKSASTPEEKETSDITSDDELPAQLADPTTVPPKSA 


75 


Db 


1277 


1336 


Qy 


76 


APLMDFGNDFVP PA PRGPL PAAP P V- AP ERQPSWDPSPVSS T VP AP S P L S AA 

II: II : 1 1 1 II 1 : 1 1 :: 1 1 1 1 
KDREDTGSIESPPTTEEAI EVEVQAKQEAQKPVPAPEEAI KTEKSPLASK-ETSRPESAT 


126 


Db 


1337 


1395 


Qy 


127 


: M I ||: M: Ml M 1 : 1 1 
GSVKEDTEQTKSKKSPVPSRPESEAKDKKSPFASGEASRPESVAESVKDEAGKAES 


177 


Db 


1396 


1451 


Qy 


178 


RRGSSGAWXXXXKIMDLKEQPGNTISAGQE -DFPSVLLETAASXP-SLSPLSA 

I I 1 : M 1 1 : III II 1 1 : : 1 


229 



Db 1452 RRESIAKTHKDESSLDKAKEQESRRESIAESIKPESGIDEKSALASKEASRPESVTDKS- 1510 

Qy 230 ASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEMGSS 289 

II : : : I I I I : I I I I : : I I I I : : 

Db 1511 KEPSRRESIAESLKAESTKDEKSAPPSKEASRPGSWESVKDETEKSKEPSRRESIA 1567 

Qy 2 90 FSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNN 1 LHXQQELPTALTKLVKEDEW 346 

I I I | : I I : I III I I : : I : : : I I : : : 

Db 1568 ESAKPPIEFRE-VSRP-ESVIDGIKDESAKPESRRDSPLASKEASRPESVLESVKDEPIK 1625 

Qy 347 SSEKAK DSFNEKRVAVE-APMREEYADFKPFERVWEVKDS — KEDSDMLAAGGKI 398 

I : I I : : : I I I : I : : : I I I I : I I I : I I : 

Db 1626 STEKSRRESVAESFKADSTKDEKSPLTSKDIS-RPESAVENVMDAPFKETSRPESAVGSM 1684 

Qy 399 ESNLESKVDKKCFADSLEQTNHE — KDSESSNDDTSFP-STPEGIKDRSGAYITCAP FNP 455 

: || I : I I I ::: I I I I I I I I I I 

Db 1685 KDESMSK EPS RRES VKDGAAQ S RET S RPASVAES AKD — GADDLKELSRP 1732 

Qy 456 AATE SIATNI FPLLEDPTS ENXTD — EKKIEEKKAQIVTEKN — TSTK 499 

: I I I I I : I | : | | | | | : : | | I : I : 

Db 1733 ESTTQSKEAGSIKDEKSPLASEEASRPASVAESVKDEAEKSKEESRRESVAEKSPLPSKE 17 92 

Qy 500 TSNPFFVAAQDSETDYVTTDNLTKVTEEWANMP EGLTPDLVQEACESELNEVTGT 555 

I I I I : I I II I I I I I: : I 

Db 1793 ASRPASVA — ES I KDEAEKSKEESRRESVAEKSPLPSKEAS RPASVAES I KDE AE 1845 

Qy 556 KIAYETKMDLVQTSEVM — QESLYPAAQLCPSFEESEATPSPVLPDIVME-APLNSAVPS 612 

I I : : : I : : I : I I : : I : I : : I I : I I I I 

Db 1846 KSKEESRRESVAEKSPLPSKEASRPASVAES I KDEAEKSKEESRRESVAEKSPLPSKEAS 1905 

Qy 613 AGASVIQPSSS PLEAS SVNYESIKHEPENPPPYEEAMSVSLKVSGIKEEI KEPENIN 669 

IN: II : : | | : | | : I I : I I I 

Db 1906 RPASVAES I KDEAEKSKEESRRESVAEKSPLPSKEAS RPASVAES I KDEAEKSKEESRRE 1965 

Qy 670 AALQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQPVPDHSELVEDSSPDSEP 729 

: : : : I II II I : I : 1 I I I : I I 

Db 1966 SVAEKSPLP SKEASRPASVAESIKDEAEKSKEE SRRESVAEKSP 2009 

Qy 730 VDLFSDDSI PDVPQKQDETVMLVKESLTETS FESMI E — YENKEKLSALP-PEGGKP — Y 7 84 

: I I : | | | : : : : | | | : : I | I I : I 

Db 2010 LP SKEASRPASVAESIKDEAEKSKEE SRRESVAEKSP LP SKEASRPASV 2058 

Qy 785 LESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEE LSTAVYSNDDLFI SKEAQIRE 840 

III : : I : I I I I II : I II I : I I I I 

Db 2059 AESIKDEAEKSK EESRRESAAEKSPLPSKEASRPASVAESVKDEADKSKEESRR- 2112 

Qy 841 TETFSDSSPIEII — DEFPTLISSKTDSFSKLAREYTDLEVSHKSEIANAPDGAGSLPCT 898 

I : : : I : I I : I I : : I : I I I I I : III: 

Db 2113 -ESMAESGKAQSIKGDQSPLKEVSRPES VAESVKDDPVKSK-EPSRRESVAGSVTAD 2167 

Qy 899 ELPHDLSLKNIQPKVEEK ISFSDDFSKNGSATSKVLLLPPDVSALGHTQAE 949 

I :: I :| I I |: I I ::: 

Db 2168 SARDDQSPLESKGASRPESWDSVKDEAEKQES RRESK 2205 

Qy 950 IESIVKPKVLEKEAEKKL PSDTEKEDR-SPSAIFSADLGKTSWDLLYW RDI 1000 

I I : : I I : : : I : : : : I : I I I I : : I : : : I I 

Db 2206 TESVIPPKAKDDKSPKEVLQPVSMTETIREDADQPMKPSQAESRRESIAESIKASSPRDE 2265 



Qy 1001 KKTGWFGASLFLLLSLTVFSIVSWAYIALALLSWISFRIYKGVIQAIQKSDEGHPFR 1060 

I II II : : | : : | | I I I I I 

Db 2266 KSP LASKEASRPGSV AESIKYDLDKPQI IKDDKSTE-HSRR 2305 

Qy 1061 AYLESEVAI-SEELVQKYSNSALGH 1084 

I I : I : I I : I : : I I 
Db 2306 ESLEDKSAVTSEKSVSRPLSVASDH 2330 



RESULT 9 
A47282 

calcium-binding protein calphotin - fruit fly (Drosophila melanogaster ) 
C; Species: Drosophila melanogaster 

C;Date: 21-Sep-1993 #sequence_revision 18-Nov-1994 #text_change 21-Jul-2000 
C; Accession : A472 82 

R;Martin, J.H. ; Benzer, S.; Rudnicka, M. ; Miller, C.A. 
Proc. Natl. Acad. Sci. U.S.A. 90, 1531-1535, 1993 

A; Title: Calphotin: a Drosophila photoreceptor cell calcium-binding protein. 

A; Reference number: A47282; MUID : 93165729 ; PMID: 8094559 

A;Accession: A47282 

A; Status : preliminary 

A;Molecule type: nucleic acid 

A; Residues: 1-8 65 <MAR> 

A;Cross-references: GB:L02111; NID:gl57031; PIDN : AAA2 8405 . 1 ; PID:gl57032 
A; Experimental source: photoreceptor cells 

A;Note: sequence extracted from NCBI backbone (NCBIN : 124 955 , NCBIP : 124 956) 

C; Genetics : 

A; Gene : FlyBase:Cpn 

A; Cross-references : FlyBase : FBgn0010218 

C; Superf amily : collagen alpha 1(1) chain; fibrillar collagen carboxyl-terminal 
homology; von Willebrand factor type C repeat homology 
C; Keywords: calcium binding 

Query Match 5.5%; Score 324.5; DB 2; Length 865; 

Best Local Similarity 22.3%; Pred. No. 4.1e-06; 

Matches 229; Conservative 128; Mismatches 375; Indels 297; Gaps 49; 

P AAG L S AAP VPT AP AAGAP LMD FGN D FVP PAP RG P L P AAP P VAP E RQ P S WD P S P VS S T VP 118 
I : I I I I I I I I : Mill: I I I I : I : 

PVSAPVAAPV-TPSAVAAPVQWSPAAVAPAPAAPIAVTPVAPPPTLASVQPATV — TI P 65 

APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAP APAAPP — ST 172 

I I : I : : I I : I : I II I I I I : I I : I I I I I I I I : I 

AP AP I AAAS VT P VAS VAP P WAAP T P P AA- S P VS T PVAVAQ I P VAVS AP VAP PVAAT 121 

PAAPKRRGSSGAWXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSPLSAASF 232 
I : : I : I II I : I I I : I I : I = 



: I I | 1 : : | : : | : I 

VPANTT VPVAAPVAAVPAAVPWAPVLAP AV 194 



Qy 


59 


Db 


9 


Qy 


119 


Db 


66 


Qy 


173 


Db 


122 


Qy 


233 


Db 


161 


Qy 


293 


Db 


195 



1:11 I III : I : hi II 



Qy 348 SEKAKDSFNEKRVAVEAPMREEY7VDFKPFERWEVKDSKEDSDMIAAGGKIESNLESKVD 407 

: | : I I I I I I : : : I : : : I 

Db 235 T K P LAAAE PVWAP PAT E T PWAPAAAS PHVSVAPAVETAWAPVS 28 0 

Qy 4 08 KKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAP FNPAATESIATNIFP 4 67 

II : : I : I I I II 
Db 281 ASTEPPV AAAT L T TAP ET PAL 301 

Qy 4 68 LLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTEE 527 

I : I I III II 
Db 3 02 APWAESQ VAA NT WAT P P 32 0 

Qy 52 8 VYANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPA 57 9 

I | | : I : I : I II : I I I : I I I I : I I 

Db 321 TPAPEPETIAPPW AET P EVAS VAVA- ETT P PW — PPVAAESI-PAPWATTPV 371 

Qy 580 -AQLCPSFEESEAT PS PVLPDIVMEAPLNSAVP SAGAS VIQPSSS PLEAS SVNYESIKHE 638 

I I : : I : t I I : : : I : I I I : I I : I I : 

Db 372 PATLAVTDPDVTASAVPELPPVIAPSPVPSAVAETPVDLAPPVLPPVAAEPVPAWAEET 431 

Qy 639 PENPPPYEEAMSV-SLKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPA 697 

I I I I : : : : | : : I I : I I I : I : : I I I 

Db 4 32 PET PAPAS APVT I AALDI P EVAP VI AAP S DAPAEAP S AAAP I VST P PTTAS VPETTAP PA 491 

Qy 698 P DFSDYSEMAKVEQPVPDHSEL VEDSSPDSEPVDLFSDDSIP — DVPQK 744 

11111:111 I : I I : I I I : I : I 

Db 492 AVPTEPIDVSVLSE-AAIETPVAPPVEVTTEVAVADVAPPEAAADLIIEPVEPPAPIPDL 550 

Qy 745 QDETVMLVKESLTETSFESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEV 804 

: : I : | : : : I : : I I I : : : : I 

Db 551 LEQTT S VPAVEAAEST S S P I P E TSLPPPNEAVASPEVAVAPITAPEPI PEPEP 603 

Qy 8 05 STLSKKEKIPLQ MEELSTAVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLI 860 

I : I lh: ::| II I : II I I I I III : 
Db 604 S LAT P T E P I P VEAP WI Q EAVDAV EVPVTETST SIP-ETTVEFPEAV 64 9 

Qy 861 SSKTDSFSKLAREYTDLEV-SHKSEIANAPDGAGSLPCTELP HDLSL 906 

: I I | : | : : : : | | | M I I I : I : : : 

Db 650 AEKV LDPAITEAPVTTQEPDVANINDGA PAT E I T T P AVE I VT AAAE VS D I AI 701 

Qy 907 KNIQPKVEEKISFSDDFSKNGSATSKVLL LP PDVSALG 944 

I I I ::|: : : : : ::|:: :| hll 

Db 702 PVIDPPVPQEIAVA-EIPETDTKPAEVIVEQSTIPIEAPVPEVSKYAEPVISEAPAAEVP 760 

Qy 945 HTQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLY 996 

: I I : | | : I I I : I : :: III I : I : II 
Db 761 ITAGDNPDNTSVGISEW-PTIAEKPVEEVPTSEIPEQSSSPSD — SVPVAK — ITPLL- 814 

Qy 997 WRDIKKTGV 1005 

II:: I I 
Db 815 -RDLQTTDV 822 



RESULT 10 
A47283 

calphotin - fruit fly (Drosophila melanogaster ) 



C; Species: Drosophila melanogaster 

C;Date: 21-Sep-1993 #sequence_revision 25-Apr-1997 #text_change 21-Jul-2000 
C;Accession: A47283 

R;Ballinger, D.G.; Xue, N. ; Harshman, K.D. 

Proc. Natl. Acad. Sci. U.S.A. 90, 1536-1540, 1993 

A;Title: A Drosophila photoreceptor cell-specific protein, calphotin, binds 
calcium and contains a leucine zipper. 

A; Reference number: A47283; MUID : 93165730 ; PMID: 8434015 

A; Contents: photoreceptor cells 

A;Accession: A47283 

A; Status : preliminary 

A;Molecule type: nucleic acid 

A; Residues: 1-873 <BAL> 

A;Cross-references: GB:L05080; NID:gl57071; PIDN : AAA2842 0 . 1 ; PID:gl57072 
A;Note: sequence extracted from NCBI backbone (NCBIN : 124 958 , NCBIP : 124 959 ) 
C;Superfamily: collagen alpha 1(1) chain; fibrillar collagen carboxyl-terminal 
homology; von Willebrand factor type C repeat homology 

Query Match 5.4%; Score 320.5; DB 2; Length 873; 

Best Local Similarity 22.5%; Pred. No. 6e-06; 

Matches 235; Conservative 122; Mismatches 367; Indels 321; Gaps 50; 

Qy 59 P AAGL S AAP V- PT AP AAGAPLMD FGN D EVP P AP RG P L P AAP PVAPERQPSW-DPSP 112 

I : 11111:111 :: III Mill I I I I I : I 

Db 9 PVSAPVAAPVTPSAVAAPVQWSPAAVAVAPAWAPAPAAPTAVTPVAP — PPTLASVQP 66 

Qy 113 VSSTVPAPSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPP A 163 

: I I I I I : I :: I I : I : I II I I I I I I I I I I 

Db 67 ATVTVPAPAPIAAASVAP VASVAPPWAAPTPPA AS P VS T P P VAVAQ I PVAV 118 

Qy 164 PAPAAPP STPAAPKRRGSSGAWXXXXKIMDLKEQPGNTISAGQEDFPSVLLET — 217 

II I I I I I I I - I >l 

Db 119 SAPVAPPVAATPTPVAP - 1 PVAAPVI AT P PVAASAPT 154 

Qy 218 -AASXPSLSPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLT 276 

I I I : I I : I I : I I I I : : I : : I 

Db 155 PAAVTPWSPVIAT PPWPANTT VPVAAPVAAVPAAVPWAPVLA 199 

Qy 277 EFSELEYSEMGS S FSVS PKAESAVI VAN PREEI IVKNKDEEEKLVSNNI LHXQQE 331 

: I : I I : I I I III 
Db 200 P AVAPAV — APWAET PAP P PVAE I P VAT 226 

Qy 332 LPTALTKLVKEDEWSSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDM 391 

: I : I : I I I : : I : I I II I I 

Db 227 IPECVAPLIPEVSWAT KPLAAAEPVWAP PATET PWAPAAASPH 272 

Qy 392 LAAGGKIESNLESKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCA 451 

:: : | : : : I || : : I :| I 
Db 27 3 VSVAPAVETAWAPVS ASTEPPV AAATLTTA 3 03 

Qy 452 PFNPAATESIATNIFPLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDS 511 

I I I I : I I Ml 
Db 304 PET PAL APWAESQ VAA 320 

Qy 512 ETDYVTTDNLTKVTEEVVANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEV 571 

I I I I I : I : I : I II Mil : I I 

Db 321 NTWATPPTPAPEPETIAPPW AET P EVAS VAVA- ETT P P W — PPV 364 



Qy 572 MQESLYPA AQLCPSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSS 622 

I I : I I II : : I : III:: : I : I I I : I 

Db 365 AAESI-PAPWATTPVPATLAVTDPDVTASAVPELPPVIAPSPVPSAVAETPVDLAPPVL 423 

Qy 623 S PLEAS S WYES I KHEPENPPPYEEAMSV-SLKVSGIKEEIKEPENINAALQETEAPYIS 681 

I: I I : I I I I ::: :| : : I I : I I I :| 

Db 424 P PVAAE P VPAWAEET P ET PAPAS APVT I AALD I PEVAP VI AAP S DAPAEAP S AAAP I VS 483 

Qy 682 IACDLIKETKLSAEPAP DFSDYSEMAKVEQPVPDHSEL VEDSSPDSEPV 730 

: : I II 11111:111 I : I I : I 

Db 4 84 T P P T T AS VP ET TAP P AAVP T E P I D VS VL S E - AAI ET P VA P P VE VT T E VAVAD VAP P EAAE 542 

Qy 731 DLFSDDSIP — DVPQKQDETVMLVKESLTETSFESMIEYENKEKLSALPPEGGKPYLESF 788 

II: I : I : : I : I : : : I : : I I I 

Db 543 DLIIEPVEPPAPIPDLLEQTTSVPAVEAAESTSSPIPE TSLPPPNEAVASPEV 595 

Qy 789 KLSLDNTKDTLLPDEVSTLSKKEKIPLQ MEELSTAVYSNDDLFISKEAQIRETETF 844 

: : : : I I : I II:: : : I II I : I I I 

Db 596 AVAPITAPEPIPEPEPSLATPTEPIPVEAPWIQEAVDAV EVPVTETST- 64 4 

Qy 845 SDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEV-SHKSEIANAPDGAGSLPCTELP — 901 

III III :: I I |: | : : ::|| Ml I I I : 
Db 645 — SIP-ETTVEFPEAVAEKV LDPAITEAPVTTQEPDVANINDGA PATEITTP 693 

Qy 902 HDLSLKNIQPKVEEKI SFSDDFSKNGSATSKVLL LP PDVS 941 

I : : |||::|::: : : : : I : : : I I : I I 

Db 694 AVEIVTAAAEVSDTAIPLIDPPVPQEIAVA-EIPETETKPAEVIVEQSTIPIEAPVPEVS 752 

Qy 942 ALG HTQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSA 98 0 

: I I : I I : I I I : I : : : III 
Db 753 KYAEPVISEAPAAEVPITAGDNPDNTSVGISEW-PTIAEKPVEEVPTSEIPEQSSSPSD 811 

Qy 981 I FSADLGKTSWDLLYWRDI KKTGV 1005 

I : I : II I I : : I I 
Db 812 --SVPVAK — ITPLL — RDLQTTDV 830 



RESULT 11 
T34418 

hypothetical protein F12F3.3 - Caenorhabditis elegans 
C; Species: Caenorhabditis elegans 

C;Date: 29-Oct-1999 #sequence_revision 29-Oct-1999 #text_change 29-Oct-1999 

C;Accession: T34418 

R; Fulton, B. ; Wohldmann, P. 

submitted to the EMBL Data Library, July 1998 

A; Description: The sequence of C. elegans cosmid F12F3. 

A; Reference number: Z21521 

A; Access ion: T34418 

A; Status: preliminary; translated from GB/EMBL/DDBJ 

A;Molecule type: DNA 

A; Residues: 1-3488 <FUL> 

A;Cross-references: EMBL : U8 0022 ; PIDN : AAC25885 . 1 ; GSPDB : GN00023; CESP:F12F3.3 

A; Experimental source: strain Bristol N2; clone F12F3 

C; Genetics: 

A; Gene: CESP:F12F3.3 

A;Map position: 5 



A;Introns: 281/3; 332/1; 562/3; 600/3; 1866/3; 1944/3; 3393/1 



Query Match 5.1%; Score 299.5; DB 2; Length 3488; 

Best Local Similarity 20.6%; Pred. No. 0.0003; 

Matches 272; Conservative 187; Mismatches 475; Indels 387; Gaps 57; 

Qy 1 MEDLDQSPLV — SSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEV 54 

: : : : | | : : I : : I I : : : : I I h | | : : I : : I 

Db 340 VDEVDDSTVLEEKKDDGDDKSKPKTKKKIIKKKETPESEQVTAAEPEQQKISEVDVQSVA 399 

Qy 55 LERKPAA GLSAAPVPT APAAGA PLMD FGNDF 85 

: : I I I III: II II 

Db 400 ETEVGAKKKPDAEKPTDLSKAKKDSKSKKSDEPEASTEEKSTTEKPTNDKTSKKSAEKKT 459 

Qy 86 VPPAPR GPLPAAPPVAPERQPSWDPSPVSSTVPA PSPLSAAAV 128 

II I I I I I : : I I I : I I : I : 

Db 460 VKPKKEVTGKPLEAKKPVEDKKDASQPSSSKESSPPTDGKKKKQIPKALFIPDEISSRFG 519 

Qy 12 9 SPSKL PEDDEPPARPP — PPPPASVS PQAEPVWTPPAP 164 

||: : 1:111111 : | | : : 

Db 520 DPSTMHSETNITTTIRGREGSADAKTPLVEPLSASVSMKVFTLVESAKEKAEFSFKRRSE 57 9 

Qy 165 AP AAPPSTPAAPK RRGSSGAWXXXXKIMD LKEQPGN 201 

I ||: : | : I: I:: I :| ::M : 

Db 580 TPDDKSRKKEGLPPAKKSEKKDEVTAEKQSTE7VLIESKKKEVDESKISEQQPSDKNKSEV 639 

Qy 202 TISAGQEDFPSVLLETAASXPSLSPLSAASFKEHEYLGNLSTVL PTEGTLQEN 254 

: I I I I I : : I : I I I : : : : 

Db 640 VGVPEKAAGPETKKDV — SEIEEVPKKKTIKKKTEKSDSSI SQKSNVLKPADDDKSKSDD 697 

Qy 255 VSEASKEVSEKAKTLLID RDLTEFSELE YSEMGSSFSVSP 294 

I : : M : : I : I I I : I I : I I I : I 

Db 698 WDKSKKTTEDQTKVATDSKLEKAADTTKQIETETWDDKSKKKVLKKKTEKSDSF-ISQ 756 

Qy 295 KAESAVIV — ANPRE EI I VKNKDEEEKLVSNNI LHXQQELPTALTKLVKED 343 

|:|: :| II :| || :::| I :|: ::| I |: I 
Db 757 KSETPPWEPTKPAESEAQKIAEVNKAKKQKEVDDNL KREAEVAAKKI ADEKLKI EA 813 

Qy 344 — E WS S E KAKD S FNE K RVAVEAPMRE E YAD FK P F E RVW EVK D S K E D S DMLAAG 395 

|| :::| |: : :: I :: |: |: ::| : I I 
Db 814 EANIKKTAEVEAAKKQKEKDEQLKLETEWSKKSAAEKLELEKQAQIKKAAEAD AVK 870 

Qy 396 GKIESNLESKVD — KKCFAD — SLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCA 451 

: I I : : I : : I I I I I I : : I : I : I I I : : I 

Db 871 KQKELNEKNKLEAAKKSAADKLKLEEESAAKSKKVSEESVKF GEEKKTKAGEKTVQV 927 

Qy 452 PFNPAATESIATNIFPLLEDPTSENXTDEKKIEEKKAQI VTEKNTSTKTSNPFFVAAQDS 511 

I : : : I I II I I : I I I : I I I I I : I : : I I 

Db 928 ESEPTSKKTIDTKDVGATE-PADE--TPKKKIIKKK TEKSDSS ISQKS 972 

Qy 512 ETDYWTDNLTKVTEEWANMPEGLTPDLVQEACESELNEVTGTK IAYETKMDL 565 

M : : : : I I : I : I I I : I : : I I I I : I I : 

Db 973 ATD SEKVSKQKEQDEPTKPAVSETQMVTEADKSKKQKETDEKLKLDAEIAAKTKQEA 1029 

Qy 566 VQTSEV-MQESLYPAAQLCPSFEESEATPSPVLPD-IVMEAPLNSAVPSAGASVIQPSSS 623 

: | : : I I : : hi | | : : | : : : I I : : : 

Db 1030 DEKSKLDAQEKIKKVS EDDAARKEKELNDKLKLESEIATKKASADKLKLEEQAQ 1083 



Qy 624 PLEASSVNYESIKHEPENPPPYE-EAMSVSLKVSGIKEEIKEPENINAALQETEAPYISI 682 

:|: | |: | : I : : : I I : I I::: II I : 

Db 1084 AKKAAEV— EAAKKQKEKDEQLKLDTEAASKKAAAEKLELEKQAQIKKA AGADAVKK 1138 

Qy 683 ACDLIKETKLSAEPAPDFS DYSEMAKVEQPVPDHSEL VE 721 

:| :: | | I : || :| I : ::| I 

Db 1139 QKELDEKNKLEANKKSAAGKLKIEEESAAKSKQTVEEQAKLDAQTKAKTAEKQTKLEKDE 1198 

Qy 722 DSSPDSEPVDLF SDDSI PDVP 742 

I : : I I : I I I I I 

Db 1199 KSTKESESKETVDEKPKKKVLKKKTEKSDSSISQKSETSKTWESAGPSESETQKVADAA 1258 

Qy 743 QKQDET VMLVKESLTETSFESMIEYENKEKLSA LPP 778 

: I I I I : I : t I : I : I I : I I : I I 
Db 1259 RKQKETDEKQKLEAEITAKKSADE KSKLEAESKLKKAAEVEAAKKQKEKDEQLKLDT 1315 

Qy 779 EGGKPYLESFKLSLDNTKDTLLPDEVSTLSKK EKIPLQME 818 

I : I I I : I I : I : I I : I : : 

Db 1316 EAASKKAAAEKLELEKQSHIKKAAEVDAVKKQKELEEKQRLESEAATKKADAEKLKLEEQ 1375 

Qy 819 ELSTAVYS NDDLFISKEAQIRETETFS DSSPIEIIDEFP 857 

: | : I : I I I I : I I I : : I I : I I I 

Db 1376 KKKAAEIALIEIQKEQEKLAQEQSRLEDEAKKSAEKQKLESETKSKQTEEAPKESVDEKP 1435 

Qy 858 — TLISSKTDSFSKLAREYTDLEVSHKSEIANAP-DGAGSLPCTELPHDLSLKNIQPKVE 914 

: : I I I : I : I II : I : I I = I I : I I III 
Db 1436 KKKVLKKKT EKSDSSISQKSKSAKSTVDAAETL ESDFNLVEKKTVQKVE 1484 

Qy 915 EKISFSDDFSKNGSATSKVLLLPPDVSALGHTQAEIESIVK 955 

i : I I I I : : I III : : I I 

Db 1485 QSPDESTSATIKRDPAQKTEEISKQDDGDEKKTTTDGKPPKPEDSEA TPKKRWK 1539 

Qy 956 PK VLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWD 993 

I I I I : I I I I I I I I I I I = 

Db 154 0 KKTQKS D SVAS DAS LADVSKLSDDVEEKP KKKVLKKKT EKSDSVISETSSVDT I KPESVE 1599 

Qy 994 L 994 

Db 1600 I 1600 



RESULT 12 
S37431 

ankyrin 2, neuronal long splice form - human 

N;Alternate names: ankyrin B, 440K splice form; ankyrin-B; brain ankyrin; non- 
erythroid ankyrin 

N; Contains: ankyrin 2, short form 
C; Species: Homo sapiens (man) 

C;Date: 06-Jan-1995 #sequence_revision 06-Jan-1995 #text_change 13-Aug-1999 
C;Accession: S37431; A39643; B39643; A40334; A49462; S14533; S14569 
R;Chan, W. 

submitted to the EMBL Data Library, September 1993 
A; Reference number: S37431 
A; Access ion: S3 7 431 
A; Status : preliminary 
A; Molecule type: mRNA 



A; Residues: 1-3924 <CHA> 

A;Cross-references: EMBL:Z26634; NID:g406287; PIDN : CAA813 87 . 1 ; PID:g406288 
R;Otto, E.; Kunimoto, M. ; McLaughlin, T . ; Bennett, V. 
J. Cell Biol. 114, 241-253, 1991 

A; Title: Isolation and characterization of cDNAs encoding human brain ankyrins 

reveal a family of alternatively spliced genes. 

A; Reference number: A39643; MUID : 913024 66 ; PMID: 1830053 

A; Accession : A3 9 64 3 

A; Status : preliminary 

A; Molecule type: mRNA 

A; Residues: 1-2077 <0T1> 

A; Cross-references : GB: X56957 

A;Accession: B39643 

A; Status: preliminary 

A;Molecule type: mRNA 

A;Residues: 1-1443,3585-3924 <OTT> 

A;Cross-references : EMBL:X56958 

R; Tse, W.T.; Menninger, J.C.; Yang-Feng, T.L.; Francke, U.; Sahr, K.E.; Lux, 
S.E.; Ward, D.C.; Forget, B.G. 
Genomics 10, 858-866, 1991 

A;Title: Isolation and chromosomal localization of a novel nonerythroid ankyrii 
gene . 

A; Reference number: A40334; MUID : 9200992 1 ; PMID: 1833308 
A;Accession: A40334 
A; Molecule type: DNA 

A;Residues: 4 63-474 ,' PE 1 , 477-4 95 <TSE> 

A;Cross-references: GB:M37123; NID : gl78647 ; PIDN : AAA62828 . 1 ; PID:gl78648 
R;Chan, W. ; Kordeli, E. ; Bennett, V. 
J. Cell Biol. 123, 1463-1473, 1993 

A;Title: 440-kD ankyrinB: structure of the major developmentally regulated 
domain and selective localization in unmyelinated axons. 
A;Reference number: A49462; MUID : 94075409 ; PMID: 8253844 
A; Accession : A49462 

A; Status: preliminary; nucleic acid sequence not shown 
A; Molecule type: mRNA 
A; Residues: 1-3924 <RES> 

A;Cross-references: EMBL:Z26634; NID:g406287; PIDN : CAA81387 . 1 ; PID:g406288 

C; Genetics: 

A; Gene: GDB : ANK2 

A;Cross-references: GDB: 127607; OMIM: 106410 
A;Map position: 4q25-4q27 

C; Superf amily : ankyrin; ankyrin repeat homology 
C; Keywords: alternative splicing 

F;2-3924/Product : ankyrin 2, long form #status predicted <MAT> 

F;2-1443, 3585-3924/Product : ankyrin 2, short form {(status predicted <MA2> 

F; 63-95/Domain : ankyrin repeat homology <AN01> 

F; 96-12 8 /Domain : ankyrin repeat homology <AN02> 

F; 129-161/Domain : ankyrin repeat homology <AN03> 

F; 162-190/Domain: ankyrin repeat homology <AN04> 

F; 19 1-22 3 /Domain : ankyrin repeat homology <AN05> 

F;232-264/Domain: ankyrin repeat homology <AN06> 

F; 2 65-2 97 /Domain : ankyrin repeat homology <AN07> 

F;298-330/Domain: ankyrin repeat homology <AN08> 

F;331-363/Domain: ankyrin repeat homology <AN09> 

F;364-396/Domain: ankyrin repeat homology <AN10> 

F; 397-42 9/ Domain : ankyrin repeat homology <AN11> 

F; 4 3 0-4 62 /Domain : ankyrin repeat homology <AN12> 



F; 463-4 95 /Domain : ankyrin repeat homology <AN13> 

F; 4 96-52 8 /Domain : ankyrin repeat homology <AN14> 

F;529-561/Domain: ankyrin repeat homology <AN15> 

F; 562-5 94 /Domain : ankyrin repeat homology <AN16> 

F; 5 9 5- 62 7 /Domain : ankyrin repeat homology <AN17> 

F; 62 8- 6 60 /Domain : ankyrin repeat homology <AN18> 

F; 661- 6 93 /Domain : ankyrin repeat homology <AN19> 

F; 694-72 6/ Domain : ankyrin repeat homology <AN20> 

F; 72 7-7 5 9 /Domain : ankyrin repeat homology <AN21> 

F; 760-7 92 /Domain : ankyrin repeat homology <AN22> 

F; 793-825/Domain : ankyrin repeat homology <AN23> 

Query Match 5.0%; Score 2 99; DB 2; Length 3924; 

Best Local Similarity 21.5%; Pred. No. 0.00037; 

Matches 258; Conservative 167; Mismatches 417; Indels 358; Gaps 62; 

Qy 14 DSPP RPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLE ELEVLERK 5 8 

ill : I I | :::::::: | | : | | II 

Db 1648 DIPPDETQSTQKQHKPSLGIKKPVRRKLKEKQKQKEEGLQASAEKAELKKGSSEESLGED 1707 

Qy 59 P AAG L S AAP VP T AP AAGAP LMD FGNDFVPPAPRG 92 

I I I : I : I I I : I I : : I : I =1 

Db 1708 P — GLAPEPLPTVKAT-SPLIEETPIGSIKDKVKALQKRVEDEQKGRSKLPIRVKGKEDV 1764 

Qy 93 PLPAA-PPVAPERQPSWDPSP VSSTVPAPSPL 123 

I I I I I : I I III II M: 

Db 1765 PKKTTHRPHPAASPSLKSERHAPGSPSPKTERHSTLSSSAKTERHPPVSPSSKTEKHSPV 1824 

Qy 124 SAAA VSPSKLPEDDEPPA RPPPPPPASVSPQAEPVW 159 

I : I I I I I I I : | M | : : : ! | 

Db 1825 SPSAKTERHSPASSSSKTEKHSPVSPSTKTERHSPVSSTKTERHPPVSPSGKTDKRPPV- 18 83 

Qy 160 T P PAP AP AAP P S T P AAP KRR GSSGAVVXXXXKIMDLKEQPGNTISAGQEDFPSVLLE 216 

:| I I :| ::| II I I I :| :: I 

Db 18 84 SPSGRTEKHPPVSPGRTEKRLPVSPSGRTD KHQPVSTAGKTEKHLPVSPSG 1934 

Qy 217 TAASXPSLSPLSAAS-FKEHEYLGNLSTVLPT EGTLQENVSEASKEVSEKAKT 2 68 

I : I I I : I : I : : I I : I I : I I I 

Db 1935 KTEKQPPVSPTSKTERIEETMSVRELMKAFQSGQDPSKHKTGLFEHKSAKQKQPQEKGKV 1994 

Qy 269 LLIDRDLTEFSELEYSEMGSSFSVSPKAES — AVIVANPREEI IVKNKDE — 316 

: I I : : : I : : I I : III I I : : I I I : I 

Db 1995 RVEKEKGPILTQRE-AQKTENQTIKRGQRLPVTGTAESKRGVRVSS IGVKKEDAAG 2049 

Qy 317 -EEKLVSNNI LHXQQELPTALTKLVKE DEW 346 

: I I : : I : I I : I : I hi I I : 

Db 2050 GKEKVLSHKIPEPVQSVPEEESHRESEVPK — EKMADEQGDMDLQISPDRKTSTDFSEVI 2107 

Qy 347 SSE-KAKDSFNEKRVAVEAPMREEYAD FKPFERVWEVKDSKED 388 

I : I : : I :: I : : I II : : I :: 

Db 2108 KQELEDNDKYQQFRLSEETEKAQLHLDQVLTSPFNTTFPLDYMKDEFLPALSLQSGALDG 2167 

Qy 389 -SDMLAAGGKIESNLESKVDKKCFADSLEQTNHE KDSESSNDDTSFPSTPEGIKDRS 444 

1:1 I I I : : II II : | : | : I I : I : : : : : 

Db 2168 SSESLKNEGVAGSPCGSLMEGTPQISSEESYKHEGLAETPETSPESLSF--SPKKSEEQT 2225 



Qy 



445 GAYITCAPFNPAATESIATNIFPLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPF 504 



I : II | | I ::: I I I : I : : I I I :: 

Db 2226 G ETKESTKTETTTEIRSEKEHPTTKDITGGS— EERGATVTEDSETSTESFQKE 2277 

Qy 505 FVAAQDSETDYWTDNLTKVTEEWA-NMPEGLTPDLVQEACESELNEVTGTKIAYETKM 563 

: | | : I : I I I I I : I I I I : I : : I 

Db 227 8 ATLGSPKDTSPKRQDDCTGSCSVALAKETPTGLT EEAACDEGQRTFGSS-AHKT-- 233 0 

Qy 564 DLVQTSEVMQESLYPAAQLCPSFEESEATPSPVL PDIVMEAP — LNSAV 610 

I I Ml : : | : : | | I I I : : I I I : 

Db 2331 QTDSEAQES TATSDETKALPLPEASVKTDTGTESKPQGVIRSPQGLELAL 2380 

Qy 611 PSAGASVIQ PSSSPLEASSVNYESIKHE-PEN — PPPYEEA-MSVSLKVSG 657 

|| : | : Mill : : I : I : : I I : I : M : I 

Db 2381 P S RD S EVLSAVADDS LAVS HKDS LEAS PVLEDNSSHKTPDSLEPSPLKESPCRDSLESSP 244 0 

Qy 658 IKEEIKE PEN--INAALQETE--APYISIACDLIKETKLSAEPAPDFSDYSEMAKVE 710 

:: ::| | : : ||: :|| |: |::: II I :| :| 

Db 2441 VEPKMKAGIFPSHFPLPAAVAKTELLTEVASVRSRLLRD PDGS--AEDDSLE 2490 

Qy 711 QPVPDHSELVEDS SPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 765 

I : |:| I Ml: : I : I :: I I 
Db 2491 Q TSLMESSGKSPLSPDTPSSEEVSYEVTPKTTDVSTPKPAVIHECAEED 2539 

Qy 766 EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKK EKIPLQMEELS 821 

: | I M II MM I I : I I : = I = : I I I I I 

Db 2540 DSENGEKKRFTPEE EMFKMV TKIKMF-DELEQEAKQKRDYKKEPKQEESSS 2589 

Qy 822 TAVYSNDDLFISKEA-QIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVS 880 

: |:| | : : : | : I I : I I : : M : I : 
Db 2590 S SDPDADCSVDVDEPKHTGSGEDESGV PVLVTSESRKVSSSSES 2633 

Qy 881 HKSEIANAPDGAGS LPCTELPHDLSLKN IQPKVEEKISF SDD 922 

: | : I Ml I : I I : : | | | : : : | : : I 

Db 2634 -EPELAQLKKGADSGLLPEPVIRVQPPSPLPSSMDSNSSPEEVQFQPWS KQYTFKMNED 2692 

Qy 923 FSKNGSATSKVLLLPPDVSALGHTQAEIESIVKPKVLEKEAEKKLPSD TEKEDRS 977 

: : : I MM I I I I I I I I 

Db 2693 TQEEPGKSEE EKDSESHLAEDRHAVSTEAEDRS 2725 



RESULT 13 
T16251 

hypothetical protein F35A5.1 - Caenorhabditis elegans 
C; Species: Caenorhabditis elegans 

C;Date: 20-Sep-1999 #sequence_revision 20-Sep-1999 #text_change 18-Feb-2000 
C;Accession: T16251 
R;Leimbach, D. 

submitted to the EMBL Data Library, January 1996 

A; Description: The sequence of C. elegans cosmid F35A5. 

A; Reference number: Z18485 

A;Accession: T16251 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: DNA 
A;Residues: 1-1274 <LEI> 

A;Cross-references: EMBL:U46675; NID : gll66613 ; PID: gll66621 ; PIDN: AAB52641 . 1; 
GSPDB:GN00028; CESP:F35A5.1 

A; Experimental source: strain Bristol N2 ; clone F35A5 



C; Genetics: 
A; Gene: CESP:F35A5.1 
A;Map position: X 
A;Introns: 1272/2 

Query Match 4.9%; Score 290.5; DB 2; Length 1274; 

Best Local Similarity 20.5%; Pred. No. 0.00017; 

Matches 221; Conservative 126; Mismatches 417; Indels 313; Gaps 45; 

Qy 13 SDSPPRP — QPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPAAGLSAAPVPT 70 

|:||| III:: I I I I I I I I I : I I : I I : II 

Db 2 SRAPPTPIKNPAKKWKPPWESVDEEEEME VDEETPAPSKLEKKPSLKRKDAPTKP 56 

Qy 71 APAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAPSPLSAAAVSP 130 

I : I I I | : | | | : | | I : II I 
Db 57 VPSPGAP SPVPIKNPVKKWKAPWEDDEPMEEAPAAP VP 94 

Qy 131 SKLPEDDEP PARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRGSSGAWX 187 

: I II I I : I I : : I I I I I I I : 

Db 95 AKKVRDPSPKKVPAKPRDASPKKIMAAKK EPETLPAVPP-TPVKNPVKKFKAPWED 14 9 

Qy 188 XXXKIMDLKEQPGNTI SAGQEDFPSVLLETAASXP- SLSPLSAASFKEHEYLGNLSTVLP 246 

: I : I : I I : I : I I : I I I I I I I I : : II 
Db 150 DEVDVEDVKDAP — TVPAKKTPVLKKKEPAAAAKPRDPSPKKAAPSKEHDPI VPP 202 

Qy 247 TEGTLQENVSEASKEVSEKAKTLLIDRDL — TEFSELE YSEMGSSFSVSPKA 296 

I I : : I I | : : | I I : I I I I : 

Db 2 03 T PIKNPAKKWKPPWEDDEVPTEEIKEPEPATRKVPALKKKEPSTSVKPVS 252 

Qy 297 ESAVIVANPREEIIVKNKDE EEKLVSNNILHXQQELPTALTKL 339 

: :| ::: | | : | |:: : : :| : 

Db 253 D PSPTKKVPVKKEPEVPPTPIKNPTKKWKPPWEDETPVEEV — KEPPVPEKKAPV 305 

Qy 340 VKEDEWSSEKAKDSFNEKR VAVEAPMREEYADFKPFERVWEVKDSKEDS 38 9 

: | : : : I I : I I I I : : : I I I I I I : 

Db 306 LKKKD P AP AAKARD P S P S KAAP KKVE P S S P WP PT P VKN P VKK YKP PWEVDDEPAE- 361 

Qy 390 DMLAAGGKIESNLESKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYIT 44 9 

I I : I I I : : I I : : I I : I I : 

Db 362 EVKKPSAPEKK— TPVLKRKEPEPSSTTPSSDPSPKKAAPAVKPRDSSPKK 410 

Qy 450 CAPF — NPAATESIATNIFPLLEDPTSE NXTDEKKIEE-KKAQIVTEKNTSTKTS 501 

I : | | I I : : : I : I I : I I I : : : I I 

Db 411 AT P LQ AD P KAQ EVP P T P V KNPVKKYKP PWEVDDEDPVEEVKQPEAPAKKT PVLKRK 4 66 

Qy 502 NPFFVAAQDSETDYVTTDNLTKVTEEWANMPE GLTPDLVQ 542 

I II : I I III II M I 

Db 4 67 EP AAKD TAKPATSKTPET PEKKDPVKPRDSSPKKVAAKPDSAQAPAT 513 

Qy 543 EACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSFEES 58 9 

I I : :: I : I : I : : I I I I I : 

Db 514 PVKNPVKKWRPPWEDDETPADDVSKPTDAKKTPSLAKKDPAPAKESLKPKADTKAPAKPR 573 

Qy 590 EATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSWYESIKHEPENPPPYEEAM 649 

: : I I I I I : I I I I : I : I I = 

Db 574 DPSPKKVAP TAPEKKT PVLAKKEPAGPADSKTKEPEKSKPRDPSPKKAVP 623 



Qy 650 S VSLKVSGIK— EEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSD 7 02 

: : : : I II :|:: I :| : I I 
Db 624 AKPVPKTEVAPAAVKKPEPISKPKDTAPKKAEPNSPW -PPTP 665 

Qy 703 YSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFE 7 62 

I: II II : : I I I |: : :| :| |:: |: I 

Db 666 VKNPVKKWKPPWEDDDAPAKPVSL PEPEKKTPVLAKKAPTKPDSE 710 

Qy 763 SMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELST 822 

: I I I 

Db 711 AAADPVSGP 719 

Qy 823 AVYSNDDLFISKEAQIRETETFSDSSPIEII DEFPTLISSKTDSFSKLAREYT 875 

I : I : : I : I : : III::: II : I : =1 
Db 720 SSKDPKLAKKAPVKP RD P S PMKAVP I KP AP KT EVP PAWKK P E P VAK 7 66 

Qy 876 DLEVSHKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVE EKI SFSDDFSKNGSA 92 9 

: I I II : : I I I : I : I I I : : : I 

Db 767 SRDPSPKK — AKAEPNSPWPPT — PVKNPVKKWKPPWEDDDAPAEPWVPEPEKKTPVL 822 

Qy 930 TSKVLLLPPDVS ALGHTQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSP 978 

I : I I I II | : | | : : I I II : I I I 
Db 823 AKKTPVKPRDPSPKKAVPAKPSTKTDAPPVSVKKPEPVSKPKE PSPKKAEPNSP 876 



RESULT 14 
A56577 

microtubule-associated protein MAP IB - rat (fragment) 
C; Species: Rattus norvegicus (Norway rat) 

C;Date: 21-Jul-1995 #sequence_revision 21-Jul-1995 #text_change 16-Feb-1997 
C;Accession: A56577 

R;Zauner, W. ; Kratz, J.; Staunton, J.; Feick, P.; Wiche, G. 
Eur. J. Cell Biol. 57, 66-74, 1992 

A; Title: Identification of two distinct microtubule binding domains on 
recombinant rat MAP IB. 

A; Reference number: A56577; MUID : 92347374 ; PMID: 1639092 

A;Accession: A56577 

A; Status : preliminary 

A;Molecule type: mRNA 

A; Residues: 1-2364 <ZAU> 

A; Cross-references : GB:X60550 

A; Experimental source: brain 

A;Note: nucleotide sequence not given; conceptual translation not complete 
C; Superf amily : microtubule-associated protein MAP1B 

Query Match 4.9%; Score 289; DB 2; Length 2364; 

Best Local Similarity 20.9%; Pred. No. 0.00047; 

Matches 275; Conservative 158; Mismatches 462; Indels 422; Gaps 61; 

Qy 30 EPEDEEEEEEEEEEDEDEDLEELEVLERKPAAGLSAAPVPTAPAAGAPLMDFGNDFVPPA 8 9 

It I II I I I I : : : II : : I I I I I I : > 1 : 
Db 909 EAEQSEEEGEEEEDKAEDAREEDHEPDKTEAEDYVMAWDKAAEAGVTEDQY — DFL 9 63 

Qy 90 PRGPLPAAPP--VAPERQPSWDPSPVSSTVPAPSPLSAAAVSPSKLPEDDEPPARPPPPP 147 

III : I I : I : I I : I I II I : I I 

Db 964 GTPAKQPGVQSPSREPA — SSIHDETLPGGSESEATA SDEENREDQPEEF 1011 



Qy 148 PASVS PQAEPVWTP PAPAPAAP PST PAAPKRRGS S GAWXXXXKIMDLKEQPGNT I SAGQ 2 07 

I : : II III : I : III 

Db 1012 TATSGYTQSTIEISSEPTPMDEMSTP RDVMTDETNNEETESPSQ 1055 

Qy 208 E DFPSVLLETAASXP SLSPLSAASFKE HEYLGNLSTVLPTEGTLQENV 255 

I : I I II I : I I I : : I : I I : I :: 

Db . 1056 EFVNITKYESSLYSQEYSKPWAS FNGLSDGSKTDATDGRDYNASASTISPPSSMEEDKF 1115 

Qy 256 SEAS KEVSEK AKTLLIDRDLTE 277 

| : : : I : I I : : I I I : I : 

Db 1116 SKSALRDAYRPEETDVKTGAELDIKDVSDERLSPAKSPSLSPSPPSPIEKTPLGERSV-N 1174 



Qy 278 FS ELEYSEMGS S FS- VS PKAESAVI VAN PRE EIIVKNK 314 

II I : : I I : : I I I I I : hi I I : : :: 

Db 1175 FSLTPNEIKASAEGEATAWSPGVTQAWEEHCASPEEKTLEWSPSQSVTGSAGHTPYY 1234 



Qy 315 DEEEKLVSNNILHXQQELPTALTKLVKEDEWSSE — KAKDS FNEKRVAVEAPMRE 368 

||: : : I : I III : I I I I I : : : I I I 

Db 1235 QSPTDEKSSHLPTEVTENAQAVP VS FEFTEAKDE-NER — SSISPMDE 1279 

Qy 369 EYADFK-PFERVWE VKDSKED SDMLAAGGKIESNLESKVDKKCFADSL 415 

I : I I : I : : I I : I I I : I I II I : I : I 

Db 1280 PVPDSESPIEKVLSPLRSPPLIGSESAYEDFLSADDKALGRRSESPFEGKNGKQGFSD — 1337 

Qy 416 EQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNP AATES IATNI FPLLED- 471 

I : I I I I I : : I : I hi h I : : II 

Db 1338 KESPVS-DLTSDLYQDKQEEKRAGFIPIKEDFSPEKKAS DAEIMSSQSALALDE 1390 



Qy 472 PTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTK 523 

II:: : I : : I I I : : I h h I : : 
Db 1391 RKLGGDGSPTQVDVSQFGSFKEDTKMSISEGTVSDKSATPVDEGAEDT YSHMEGVAS 1447 

Qy 524 VTEEWA — NMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQES-LYPAA 580 

I: II : II III : : h h : : : : = h 

Db 1448 VSTASVATSSFPEPTTDD-VSPSLHAEVGSPHSTEVDDSLSVSWQTPTTFQETEMSPSK 1506 

Qy 581 QLCP S FEES EAT PSPVLPDIVMEAPLNSAV P 611 

: I I :: : | : : | : | : | I 

Db 1507 EECPRPMSISPPDFSPKTAKSRTPVQDHRSEQSSMSIEFGQESPEHSLAMDFSRQSPDHP 1566 

Qy 612 SAGASVIQ PSSSPLEASSVNYESIKHE— PENPPPY EEAMSVS-LKVSG 657 

: I I : : h | : | : |: I I I I : I I I : : I 

Db 1567 TVGAGMLHITENGPTEVDYSPSDIQDSSLSHKIPPTEEPSYTQDNDLSELISVSQVEASP 1626 



Qy 658 IKEEIKEPENINAALQETE APYI SIACDLIKE TKLSAEP 696 

I I : III : | |:| : :: III : 

Db 1627 STSSAHTPSQIASPLQEDTLSDWPPRDMSLYAS LASEKVQSLEGEKLSPKSDISPLTPR 1686 

Qy 697 APDFSDYSEMAK VEQPVPDHSELVEDS 723 

: I I I I : I I : I I : : h 

Db 1687 ESSPTYSPGFSDSTSGAKESTAAYQTSSSPPIDAAAAEPYGFRSSMLFDTMQHHLALSRD 1746 

Qy 724 SPDSEPVDLFSDD SIPDVP — --QKQD 746 

Mill!: || : I : 

Db 1747 LTTSSVEKDNGGKTPGDFNYAYQKPESTTESPDEEDYDYESHEKTIQAHDVGGYYYEKTE 1806 



Qy 



747 ETVML VKESLTETSFESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKD 7 97 



■ • i i • i i i i - i i i i i : : : 

Db 1807 RTIKSPCDSGYSYETIEKTTKTPEDGGYSCEITEKTTRTPEEGGYSY EISEK 1858 

QY 798 TLLPDEVS--TLSKKEKIPLQMEELSTAVYSNDD LFISKEA 836 

I III I II: ::::| :| | |: 

Db 1859 TTRTPEVSGYTYEKTERSRRLLDDISNGYDDTEDGGHTLGDCSYSYETTEKITSFPESES 1918 

QY 837 QIRETETFSDSSP IEI IDEFPTLISSKTDSFSKLAREYT DL 877 

Ml: II I ::: : I |: I II || 

Db 1919 YSYETTTKTTRSPDTSAYCYETMEKITKTPQASTYSYETSDRCYTPERKSPSEARQDVDL 1978 

Qy 878 EVSH-KSEIANA PDG AGSLPCTELPHDLSLKNIQP KV 913 

I I MM: : | : | | | | I : | : 

Db 1979 CLVSSCEFKHPKTELSPSFINPNPLEWFAGEEPTEESERPLTQSGGAPPPSGGKQQGRQC 2038 

QY 914 EEKISFSDDFSKNGSATSKVLLLPPDVSALGHTQAEIESIVKPKVLEKEAEKK-LPSD-- 970 

: I I I I I II:" IN | : | | : : | : | 

Db 2039 DETPPTSVSESAPSQTDSDV PPETE ECPSITADANLDSEDESETIPTDKT 2088 

Qy 971 TEKEDRSPS AIFSADLGKTSWDLLYWRDIKKTG 1004 

MINI : | | | | | Ml 

Db 2 08 9 VTYKHMDPPPAPMQDRSPSPRHPDVSMVDPEALAIEQNLGKALKKDLKEKAKTKKPG 214 5 



RESULT 15 
QRMSP1 

microtubule-associated protein MAP1B - mouse 

N; Alternate names: microtubule-associated protein MAPI (X) ; microtubule- 
associated protein MAPI. 2; microtubule-associated protein MAPS 
C; Species: Mus musculus (house mouse) 

C;Date: 30-Sep-1991 #sequence_revision 30-Sep-1991 #text_change Ol-Sep-2000 
C;Accession: S07549; S44387; A33645 
R;Noble, M. ; Lewis, S.A.; Cowan, N.J. 
J. Cell Biol. 109, 3367-3376, 1989 

A; Title: The microtubule binding domain of microtubule-associated protein MAPlB 
contains a repeated sequence motif unrelated to that of MAP 2 and tau. 
A;Reference number: A33645; MUID : 90094539 ; PMID:2480963 
A;Accession: S07549 
A;Molecule type: mRNA 
A; Residues: 1-2464 <NOB> 

A;Cross-references: EMBL:X51396; NID:g52999; PIDN : CAA35761 . 1 ; PID:g53000 
R;Sanchez, C; Padilla, R. ; Paciucci, R. ; Zabala, J.C.; Avila, J. 
Arch. Biochem. Biophys . 310, 428-432, 1994 

A; Title: Binding of heat-shock protein 70 (hsp70) to tubulin. 

A; Reference number: S44387; MUID : 94234720 ; PMID: 8179328 

A; Accession: S44387 

A; Status : preliminary 

A;Molecule type: protein 

A;Residues: 653-663, 'IC <SAN> 

C; Superf amily : microtubule-associated protein MAP IB 

C; Keywords: microtubule binding; phosphoprotein; tandem repeat 

F; 5 8 9- 7 8 6/ Domain : microtubule binding #status experimental <MTB> 

F; 589-592, 639-642, 64 9-652, 655-658, 660-663, 668-671, 674-677, 679-682,683-68 6, 687- 

690, 691-694,695-698, 699-702,708-711,712-715,716-719,72 0-723,727-730,758-761, 7 64- 

767, 783-786/Region: 4-residue repeats (K/R-K-E/D-X) 

F; 1861-2064/Region: 17-residue repeats 



F; 91, 116, 351, 888, 1124, 1153, 1168, 12 08, 1662, 1877, 1918, 2 003, 2030, 2 054, 2083/Binding 
site: phosphate (Ser) (covalent) #status predicted 

F; 147, 969, 1336, 1562, 1563, 1702, 1708, 1990, 2057, 2063, 2419/Binding site: phosphate 
(Thr) (covalent) #status predicted 

F; 1953/Binding site: phosphate (Tyr) (covalent) #status predicted 

Query Match 4.9%; Score 289; DB 1; Length 2464; 

Best Local Similarity 19.8%; Pred. No. 0.0005; 

Matches 262; Conservative 148; Mismatches 474; Indels 442; Gaps 55; 



Qy 


32 


EDEEEEEEEEEEDEDEDL-EELEVLERKPAAGLSAAPVPTAPAAGAPLMDFGNDFVPPAP 
1 1 : 1 1 1 1 1 1 1 : 1 1 II : : 1 1 1 I I : | 
EAEQSEEEGEEEDKAEDAREEGYEPDKTEAEDYVMAVADKAAEAGVTEEQYGY 


90 


Db 


1009 


1061 


Qy 


91 


RGPLPAAPPV-APERQPSWDPSPVSSTVPAPSPLSAAAVSPSKLPEDDEPPARPPPPPPA 


149 


Db 


1062 


1 I : : | | : | : 1 | : | | | | | ; 1 1 1 
LGTSAKQPGIQSPSREPA — SSIHDETLPGGSESEATA SDEENREDQPEEFTA 


1112 


Qy 


150 


SVSPQAEPVWTPPAPAPAAPPSTPAAPKRRGSSGAWXXXXKIMDLKEQPGNTISAGQED 
: : 1 1 1 II : 1 : 1 1 1 1 
TSGYTQSTIEISSEPTPMDEMSTP RDVMSDETNNEETES PSQEF 


209 


Db 


1113 


1156 


Qy 


210 


FPSVLLETAASXPSLSPLSAASFK EHEYLGNLSTVLPTEGTLQENVSE 

1 : : 1 : 1 1 1 : 1 : I I : I : : I : 
VNITKYESSLYSQEYSKPAVAS FNGLSEGSKTDATDGKDYNASASTISPPSSMEEDKFSK 


257 


Db 


1157 


1216 


Qy 


258 


AS KEVSEK AKTLLIDRDLTEFS 


279 


Db 


1217 


= : 1 : 1 1 : : 1 1 1 : 1 : 1 1 
SALRDAYCSEEKELKASAELDIKDVSDERLSPAKSPSLSPSPPSPIEKTPLGERSV-NFS 


1275 


Qy 


280 


ELEYSEMGSSFSVSPKAESAVI VANPRE EIIVKNK 

1 : : 1 1 : 1 1 1 1 I 1 : hi 1 1 : : : : 
LTPNEIKVSAEGEARSVSPGVTQAWEEHCASPEEKTLEWSPSQSVTGSAGHTPYYQSP 


314 


Db 


1276 


1335 


Qy 


315 


-DEEEKLVSNNILHXQQELPTALTKLVKEDEWSSEKAKDSFNEKRVAVEAPMREEYADF 

Mi : : 1 : 1 : : 1 1 1 1 1 : 1 : 1 1 1 1 
TDEKSSHLPTEVSENAQAVPVSF EFSEAKDE-NER — ASLSPMDEPVPDS 


373 


Db 


1336 


1382 


Qy 


374 


K-PFERVWE VKDSKEDSDMLAAGGKIESNLESKVDKKCFADSLEQTNHEKDSE 


425 


Db 


1383 


: 1 1 : 1 : 1 1 : 1 1 : | I 
ESPVEKVLSPLRSPPLLGSESPYEDFLSADSKVLGR RSESPFE 


1425 


Qy 


426 


SSNDDTSFPSTPEGIKDRSGAYITCAP FNPAATESIATNI FPLLEDPTSENXT 


478 


Db 


1426 


1 M : 1 : 1 1 : 1 1 : 1 1 1 1 
GKNGKQGFPDRESPVSD LTSTGLYQDKQEEKSTGFI PIKEDFGPEKKTSDVETMS 


1480 


Qy 


479 


DEKKI EEKKAQIVTEKNTSTKTSNPFFVAAQDSETD 

1 1 : 1 : : 1 : : | 1 | : : | I : | 


514 


Db 


1481 


SQSALALDERKLGGDVSPTQIDVSQFGSFKEDTKMSISEGTVSDKSATP — VDEGVAEDT 


1538 


Qy 


515 


YVTTDNLTKVTEEWA — NMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVM 
1 : : 1 : II : 1 1 III : : I : I : : : : I I I 
YSHMEGVASVSTASVATSSFPEPTTDD-VSPSLHAEVGSPHSTEVDDSLSVSWQTPTTF 


572 


Db 


1539 


1597 


Qy 


573 


QES-LYPAAQLCP SFEESEATPSPVLPDIVMEAPLNSAV-- 


610 


Db 


1598 


II: : |: : II :: : | : : |:| :| 
QETEMSPSKEECPRPMSISPPDFSPKTAKSRTPVQDHRSEQSSMSIEFGQESPEHSFAMD 


1657 



Qy 611 PSAGASVIQ PSSSPLEASSVNYESIKHE — PENPPPY EEAM 64 9 

I : I I I I : I : : I : I : I II I : 

Db 1658 FSRQSPDHPTLGASVLHITENGPTEVDYSPCDIQDSSLSHKIPPTEEPSYTQDNDLSELI 1717 

Qy 650 SVS-LKVSGIKEEIKEPENINAALQETE APYISIACDLIKE TKLSAE 695 

I I I : : I I I : II I : I I : I : : : l||: 

Db 1718 SVSQVEASPSTSSAHTPSQIASPLQEDTLSDWPPREMSLYASLASEKVQSLEGEKLSPK 1777 

Qy 696 P APDFSDYSEMAK VEQPVPDH S ELVED S - 723 

: I I I I : I I : I I : : I : 

Db 1778 SDISPLTPRESSPLYSPGFSDSTSAAKETAAAHQASSSPPIDAATAEPYGFRSSMLFDTM 1837 

Qy 724 SPDSEPVDLFSD DS 737 

I II I I I I 

Db 1838 QHHLALNRDLTTSSVEKDSGGKTPGDFNYAYQKPENAAGSPDEEDYDYESQEKTIRTHDV 1897 

Qy 738 I PDVPQKQDETVML VKESLTETSFESMIEYENKEKLSALPPEGGKPYLESF 788 

: : I : I : I I : I : I I I : I I I I I 
Db 1898 VRYYYEKTERTIKSPCDSGYSYETIEKTTKTPEDGGYTCEITEKTTRTPEEGGYSY 1953 

Qy 789 KLSLDNTKDTLLPDEVS--TLSKKEKIPLQMEELSTAVYSNDD 829 

: : : I III I I I : : : : : | : I 

Db 1954 EISEKTTRTPEVSGYTYEKTERSRRLLDDISNGYDDTEDGGHTLGDCSYSYETTEK 2009 

Qy 830 — LFISKEAQIRETETFSDSSP IEIIDEFPTLISSKTDSFSKLAREYT 875 

I I : IM:|| I : : : : I I : III 

Db 2010 ITSFPESESYSYETSTKTTRSPDTSAYCYETMEKITKTPQASTYSYETSDRCYTTEKKSP 2 069 

Qy 876 DL EVSH-KSEIANA PDG AGSLPCTELPHDLSLKNIQP 911 

II | | | : | : : : | : I I I I I : I 

Db 2070 SEARQDVDLCLVSSCEFKHPKTELSPSFINPNPLEWFAGEEPTEESEKPLTQSGGAPPPS 2129 

Qy 912 KVEEKISFSDDFSKNGSATSKVLLLPPDVSAXGHTQ7VEIESIVKPKVLEKEAE 964 

: : I I I I I II: III : : I I 

Db 2130 GGKQQGRQCDETPPTSVSESAPSQTDSDV PPETE ECPSITADANIDSEDE 217 9 

Qy 965 KK-LPSD TEKEDRSPS AIFSADLGKTSWDLLYWR 998 

: : I : I : I I I I I : I I I II 

Db 2180 SETIPTDKTVTYKHMDPPPAPMQDRSPSPRHPDVSMVDPDALAVDQNLGKAVKKDLKEKT 2239 

Qy 999 DIKKTG 1004 

I I I 

Db 2240 KTKKPG 2245 



Search completed: September 3, 2004, 16:09:59 
Job time : 38.1547 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



September 3, 2004, 16:09:02 ; Search time 116.281 Seconds 

(without alignments) 
3191.803 Million cell updates/sec 

US-09-830-972-29 
5923 

1 MEDLDQSPLVSSSDSPPRPQ VKDAMAKIQAKI PGLKRKAE 1178 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 1298764 seqs, 315065143 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



1298764 



Database : 



Published_Applications_AA: * 

/ cgn2_6/ptodata/ l/pubpaa/US07_PUBCOMB . pep : + 
/ cgn2_6/ptodata/ l/pubpaa/PCT_NEW_PUB . pep : * 
/ cgn2_6/ptodata/ 1/pubpaa/US 0 6_NEW_PUB . pep : * 
/ cgn2_6/ptodata/ 1/pubpaa/US 0 6_PUBCOMB . pep : * 
/cgn2_6/ptodata/l/pubpaa/US07_NEW_PUB.pep: * 
/cgn2_6/ptodata/l/pubpaa/PCTUS_PUBCOMB.pep: 1 
/ cgn2_6/ptodata/l/pubpaa/US08__NEW_PUB.pep: * 
/ cgn2_6/ptodata/l/pubpaa/US08_PUBCOMB.pep: * 
/ cgn2_6/ptodata/l/pubpaa/US09A_PUBCOMB.pep: * 
/cgn2_6/ptodata/l/pubpaa/US09B_PUBCOMB.pep: 
/cgn2_6/ptodata/l/pubpaa/US09C_PUBCOMB.pep: 
/ cgn2_6/ptodata/ 1/pubpaa/US 09_NEW_PUB. pep: * 
/ cgn2_6/ptodata/ 1/pubpaa/US 10A_PUBCOMB. pep: 
/ cgn2_6/ptodata/l/pubpaa/US10B_PUBCOMB.pep: 
/ cgn2_6/ptodata/l/pubpaa/US10C_PUBCOMB.pep: 
/ cgn2_6/ptodata/l/pubpaa/US10_NEW_PUB.pep: * 
/ cgn2_6/ptodata/l/pubpaa/US60_NEW_PUB.pep: * 
/ cgn2_6/ptodata/l/pubpaa/US60_PUBCOMB.pep: * 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 
16 
17 
18 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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17 


1487.5 


25 


. 1 


373 


16 


US-10-466-258-4 
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23 


.9 
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19 
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.7 
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14 
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.0 
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US-09-893-348-20 
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.7 
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9 


US-09-893-348-25 
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931 
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.7 
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.6 
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12 
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12 
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25 
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9 
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26 
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16 
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4 
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14 
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12 
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29 
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3 
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12 
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30 
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13. 
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12 
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31 
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9 
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9 
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33 
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6 
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34 
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3 
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12 


US-10-660-946-8 


35 
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11. 


3 
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14 
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36 


630 
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6 
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6 
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10. 


6 
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1 
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10 
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8 
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8. 


7 


241 


12 
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ALIGNMENTS 



RESULT 1 
US-09-758-140-6 

; Sequence 6, Application US/09758140 
; Patent No. US20020012965A1 
; GENERAL INFORMATION: 

; APPLICANT: Strittmatter , Stephen M. 



; TITLE OF INVENTION: No. US2002 0012965Alo Receptor-Mediated Blockade of 
Axonal Growth 

FILE REFERENCE: 4 457 4-5 073-US 
CURRENT APPLICATION NUMBER: US/ 09/758 , 14 0 
CURRENT FILING DATE: 2001-01-12 
PRIOR APPLICATION NUMBER: US 60/175,707 
PRIOR FILING DATE: 2000-01-12 
PRIOR APPLICATION NUMBER: US 60/207,366 
PRIOR FILING DATE: 2000-05-26 
PRIOR APPLICATION NUMBER: US 60/236,378 
PRIOR FILING DATE: 2000-09-29 
NUMBER OF SEQ ID NOS : 2 0 
SOFTWARE : Patentln Ver. 2.1 
SEQ ID NO 6 
LENGTH: 1192 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-758-140-6 

Query Match 98.2%; Score 5815; DB 9; Length 1192; 

Best Local Similarity 97.2%; Pred. No. 5.8e-284; 

Matches 1160; Conservative 4; Mismatches 13; Indels 16; Gaps 3 

QY 1 MEDLDQSPLVSSSDSPPRPQPAFKYQEVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

I I N I I M I M I II I I I I I I I I I I I | | | | | | I | | | | | | M I I II I 

Db 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

Qy 61 AGLSAAPVPTAPAAGAPLMDFGNDFVP PAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAP 120 

I I N II I I I I I I I I I M I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I I 

Db 61 AGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAP 120 

Qy 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 18 0 

I M I I I I I I I I I I I I I I I M I I I I I I I I I I II I | | | | M I I I I I I I I I I I I I I I I I || I I 
Db 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 18 0 

Qy 18 1 SSGA WXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSP 226 

I I I : I : = I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I 

Db 181 SSGSVDETLFALPAASEPVIRSSAENMDLKEQPGNTISAGQEDFPSVLLETAASLPSLSP 240 

Qy 227 LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 286 

I I I I I M I M I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I 

Db 241 LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 300 

Qy 287 GSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDEW 346 

I I I M I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | M I I I I I I M I I I I I I I I I I I I I 
Db 301 GSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHNQQELPTALTKLVKEDEW 360 

Qy 347 SSEKAKDSFNEKRVAV^PMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESPCV 406 

M II I I I I I I I I I I I I I I I I I I II I II I I II | | M || | | | | | | | | | | | | | | | | | | | | | | | 
Db 361 SSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKV 420 

Qy 407 DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIF 466 

I I I I M II II I I I II I I I I | | | | | | | | | | | | | || | | M | | | | | | || | | || | | | | | | | | | | 
Db 421 DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIF 4 80 



Qy 



4 67 PLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTE 526 
HI I I I I I I I I I I I II I I I I I I I I | I I I I I II I I I I I I I I I I M | | | | | | | || | I I I 



Db 


481 


PLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLTKVTE 


540 


Qy 


527 


EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 
1 1 1 1 l 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | 1 j t i i [ i i i i j i i i i i i i i i i i i i i i t i i i i , i i i 

EWANMPEGLTPDLVQE^CESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 


586 


Db 


541 


600 


Qy 


587 


EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 

1 1 I 1 1 1 | j j 1 1 t | 1 i i i i t i i i i i t i i i i i [ i i i i i i i i i i i i i i i i i i > . i i i 

M 1 ■ 1 1 1 1 1 1 1 1 1 1 1 1 ' 1 1 1 1 1 1 1 1 1 1 1 I I 1 | | | | | | | | | | | | | | | | | | | | | | | | | | | | | 

EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 


646 


Db 


601 


660 


Qy 


647 


EAMSVSL-KVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 

1 1 1 1 | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 1 1 1 t 1 1 I I i i i i i i i i i 

11,1111 11 1 1 1 1 1 ' 1 I 1 1 I 1 1 1 1 1 1 If 1 1 1 M 1 1 1 1 II 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 M 1 1 

EAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 


705 


Db 


661 


720 


Qy 


706 


MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 

1 1 1 1 1 | 1 1 | 1 | | | | j i j i i i i i i i i i i i i i i i i i i i i i i i i i i i i i . , . . . , 

1 1 1 1 1 1 ' M 1 ' M 1 1 M 1 If 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 | | | 1 | | | M | | | | | | | | | | | | | | 

MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 


765 


Db 


721 


780 


Qy 


766 


EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | 1 1 1 1 1 1 | 1 1 ! I 1 | 1 1 1 1 1 | L | I | I 1 1 1 1 | , . 

1 ' ' 1 ' 1 1 1 1 ' 1 1 1 1 I' 1 1 1 1 1 1 M If 1 1 f 1 f 1 | | | | | | | || | | | | | | || | | | | | | | | | | | 

EYENKEKLSALPPEGGKPYLES FKLSLDNTKDTLLPDEVSTLSKKEKI PLQMEELSTAVY 


825 


Db 


781 


840 


Qy 


826 


SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 
1 I 1 ! I 1 1 1 1 1 1 1 I I 1 1 I 1 l l i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i ..... . 

1 1 11 1 1 I' M 1 f 1 1 f 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 I I | | | | | | || | | | | | | | | | | | | | | | | | 

SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 


885 


Db 


841 


900 


Qy 


886 


ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGH 

1 1 1 1 1 | | i | i i i i t t j i i i i i i i i i i i i i i t t i i i i i i i i i i i i i t [ 

1 11 1 1 M 1 1 1 1 1 1 1 f 1 1 1 1 1 1 If 1 1 f f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I | 1 1 1 1 1 1 1 1 1 1 1 1 

ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALA- 


945 


Db 


901 


959 


Qy 


946 


TQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGV 

1 1 1 J 1 1 | | 1 1 1 1 1 1 1 1 j 1 1 1 I I I i i i i i i i i i i i i i i | . | i i i i i i i 

1 1 ' 1 1 ■ I > ■ I i ■ ■ 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 I I I f | | : | 1 1 1 1 I I I | | | | | | | | If | 

TQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSVVDLLYWRDIKKTGV 


1005 


Db 


960 


1019 


Qy 


1006 


VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLES 

1 1 1 j 1 1 | j 1 1 | i i i i i i i i i i i i i i i i i i i t i i i i i i i i i i i i i i i * i i i i i i i i 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 If 1 1 1 1 1 1 1 1 1 I I I | | | | | | | | | | | | | | | | | | | | 

VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLES 


1065 


Db 


1020 


1079 


Qy 


1066 


EVAISEELVQKYSNSAiGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLT 

I'll 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 I I I I I II 1 1 1 1 1 1 1 1 1 1 1 II 

EVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLT 


1125 


Db 


10 80 


1139 


Qy 


1126 


LLI LALI SLFS VPVI YERHQAQI DHYLGLANKNVKDAMAKIQAKI PGLKRKAE 117 8 

> 1 1 M 1 1 1 1 1 1 1 1 M | | | | | | | | | | | | | | | | | | 1 1 1 1 I I I I | | | | | 1 1 1 1 1 1 1 

LLI LALI SLFSVPVI YERHQAQI DHYLGLANKNVKDAMAKI QAKI PGLKRKAE 1192 




Db 


1140 





RESULT 2 

US-09-972-599A-6 

; Sequence 6, Application US/09972599A 
; Patent No. US20020077295A1 
; GENERAL INFORMATION: 

; APPLICANT: STRITTMATTER, STEPHEN M. 

; TITLE OF INVENTION: NOGO-RECEPTOR-MEDIATED BLOCKADE OF AXONAL GROWTH 
; FILE REFERENCE: C077 CIP US 

; CURRENT APPLICATION NUMBER: US/09/ 972 , 599A 
; CURRENT FILING DATE: 2001-10-06 



; PRIOR APPLICATION NUMBER: PCT/US01/ 01 04 1 

; PRIOR FILING DATE: 2001-01-12 

; PRIOR APPLICATION NUMBER: 09/758,140 

/ PRIOR FILING DATE: 2001-01-12 

; PRIOR APPLICATION NUMBER: 60/236,378 

PRIOR FILING DATE: 2000-09-29 
; PRIOR APPLICATION NUMBER: 60/207,366 
; PRIOR FILING DATE: 2000-05-26 
; PRIOR APPLICATION NUMBER: 60/175,707 
; PRIOR FILING DATE: 2000-01-12 
; NUMBER OF SEQ ID NOS : 57 
; SOFTWARE: PatentlnVer. 2.1 
; SEQ ID NO 6 

LENGTH: 1192 

TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-972-599A-6 



Query Match 98.2%; Score 5815; DB 9; Length 1192; 

Best Local Similarity 97.2%; Pred. No. 5.8e-284; 

Matches 1160; Conservative 4; Mismatches 13; Indels 16; Gaps 



Qy 


1 


MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 
1 1 1 N 1 1 1 1 1 1 1 1 1 1 1 I I | | | | | | | | | | | | | | | | | || | | | | | | | | | | | | || | | | | | , | | | 

MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 


60 


Db 


1 


60 


Qy 


61 


AG L S AAP VP T AP AAGAP LMD FGN D FVP PAP RG P L P AAP P VAP E RQ P S WD P S P VS S T VP AP 

1 1 1 1 1 N 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 M 1 1 1 II 1 1 1 1 1 1 1 1 | | | | || | | | | | | | | | | | | || | | 

AG L S AAP VP TAP AAGAP LMD F GN D FVP PAP RG P L P AAP P VAP E RQ P S W D P S P VS S T VP AP 


120 


Db 


61 


120 


Qy 


121 


SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 
H M 1 1 1 1 1 1 1 1 II 1 1 1 1 1 I I I I I I | | | | 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I 1 1 II 1 1 1 II 1 II 1 1 1 
SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 


180 


Db 


121 


180 


Qy 


181 


SSGA WXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSP 
' 1 1 : 1 : : 1 1 M 1 1 1 1 1 II 1 1 II II 1 1 II 1 1 1 I I I | | M M 
SSGSVDETLFALPAASEPVIRSSAENMDLKEQPGNTISAGQEDFPSVLLETAASLPSLSP 


226 


Db 


181 


240 


Qy 


227 


LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 1 I M 1 1 1 1 1 1 1 1 1 II 
LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 


286 


Db 


241 


300 


Qy 


287 


GSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDEW 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | M 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Ml 1 1 1 1 
GSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHNQQELPTALTKLVKEDEW 


346 


Db 


301 


360 


Qy 


347 


SSEKAKDSFNEKRVAVEAPMREEYADFKPFERWEVKDSKEDSDMLAAGGKIESNLESKV 
N 1 1 1 1 II 1 1 1 1 M 1 II 1 1 M 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 | | | | | | | | M 1 1 1 1 
S SEKAKDS FNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDS DMLAAGGKI ESNLES KV 


406 


Db 


361 


420 


Qy 


407 


DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIF 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 I I I | || | | | | || | | | | | 
DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIF 


466 


Db 


421 


480 


Qy 


467 


PLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTE 
HI 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 M 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 I | | | | | || | | | | | | | | 
PLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLTKVTE 


526 


Db 


481 


540 



Qy 527 EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 58 6 

I I I I I I I M I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | M I I I I I I I I I | I I I 
Db 541 EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYP7VAQLCPSF 600 

Qy 587 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 646 

I I I I I M I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I | I I I 
Db 601 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 660 

Qy 64 7 EAMSVSL-KVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 705 

I I I I I I I I M I I I I I I I I I I I M I I II I I I I I I I I I II I I I I I I I I I || | | | | | | | | | | 
Db 661 EAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 720 

QY 706 MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 765 

I M I I I I I I I M I I I I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I || | | || | | | | | | | 
Db 721 MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 780 

QY 766 EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 825 

I I I I I I M I I I I M I I I I I I I I II I I I I I I I I I || I I I | | | | || | | | | | | | | || M | || | 
Db 781 EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 840 

QY 826 SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 885 

I I I I I I I I I I M I I I II II I I I I I I I I I I I I I I || | | | | | | | || | || | | || | | | | || | | | 
Db 841 SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 900 

Qy 886 ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGH 945 

I I I I I I I I M I II M II M I II I I I I I I I I I II I I I I I I I I I I II I I I I M || | | | | | 
Db 901 ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALA- 959 

QY 946 TQAEIES IVKPKVLEKEAEKKLPSDTEKEDRS PSAI FSADLGKTSWDLLYWRDI KKTGV 1005 

I I I I II II II II I II I I I I II I I I I M | | || | : | | | | | | | || | | || | | | || | 

Db 960 TQAEIES IVKPKVLVKEAEKKLPSDTEKEDRS PSAI FSAELSKTSWDLLYWRDI KKTGV 1019 

Qy 1006 VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLES 1065 

I I M I I I M I I II M I II I I I I I I I I I I I I 1 I I I I || | | | || || | || | | || | || | | | | | | 
Db 1020 VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLES 1079 

Qy 1066 EVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLT 1125 

I I I N I I I I I I M I I I I I M I M I I I II I I I I I I I I I I || | | | | | || I I II I I 

Db 1080 EVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLT 1139 

Qy 1126 LLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1178 

I I I I I I I I I I I I I I I I I I M I I I I I I II II I I I I I I I I I I I || I I I || I I I I I 
Db 1140 LLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1192 



RESULT 3 

US-10-060-036-71 

Sequence 71, Application US/10060036 
Publication No. US20030073144A1 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Benson, Darin R. 
Kalos, Michael D. 
Lodes, Michael J. 
Persing, David H. 
Hepler, William T. 
Jiang, Yuqiu 



; TITLE OF INVENTION: COMPOSITIONS AND METHODS FOR THE THERAPY 
; TITLE OF INVENTION: AND DIAGNOSIS OF PANCREATIC CANCER 
; FILE REFERENCE: 210121.566 

; CURRENT APPLICATION NUMBER: US/ 1 0/ 060 , 036 
; CURRENT FILING DATE: 2002-01-30 
; NUMBER OF SEQ ID NOS : 4560 

SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 71 

LENGTH: 1192 
; TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-060-036-71 



Query Match 98.2%; Score 5815; DB 14; Length 1192; 

Best Local Similarity 97.2%; Pred. No. 5.8e-284; 

Matches 1160; Conservative 4; Mismatches 13; Indels 16; Gaps 3 

MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 
M M I I I I I I I I I I I I I I I I I I 1 | | | | | | || | | | | | | | || || | 

MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

AGL S AAP VPT APAAGAP LMD FGN D FVP PAP RG P L PAAP P VAP ERQ P S WD P S P VS S TVP AP 120 

I I I I I I I I I I I I I I II I I I I I I | || | | | | | | | | | | | | | | | | | || | | | | | | | | | | 

AGL S AAP VP T APAAGAP LMD FGN D FVP PAP RGP L PAAP P VAP ERQ P S WD P S P VS S T VP AP 120 

SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 18 0 
I I I I N I II I I I I I I I I I I I || | | | | | | | | | M | | | | | | | | | | || | | | | | | | | | || | | | | 

SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 18 0 

SSGA WXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSP 22 6 

1 1 I : | : : I I I I I I I I II I I I I M I I I I I I I I I I II I I I I I 

SSGSVDETLFALPAASEPVIRSSAENMDLKEQPGNTISAGQEDFPSVLLETAASLPSLSP 240 

LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 2 86 

N I I I I I I II I M I I I I I I I I II I I I I || | | | | | | | | | | | || | | | || | M || | | 

LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 300 

GS S FSVS PKAESAVI VANPREEI I VKNKDEEEKLVSNNILHXQQELPTALTKLVKEDEW 34 6 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

GS S FSVS PKAESAVI VANPREEI I VKNKDEEEKLVSNNILHNQQELPTALTKLVKEDEW 360 

SSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKV 4 06 
I I I I I I I M I I I I I I I I I M I I I I I I I I I I I I I I I 1 I II I I I I II || I I I | | | | | | I M | 
SSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKV 420 

DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIF 4 66 
I I I I I I I I I I I I I II I I I M I I I I I I I I I I I I I I I I I I I I I II I I I I | | | | | || | | I I | | 
DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIF 4 80 

PLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTE 52 6 
HI I I I I I I I I I I I I I II I I II I I I I I I I I I I II I I I I I I I I | | | | || | | | | | | I | | 
PLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLTKVTE 54 0 

EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 58 6 
I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I I I I I I 



Qy 


1 


Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


227 


Db 


241 


Qy 


287 


Db 


301 


Qy 


347 


Db 


361 


Qy 


407 


Db 


421 


Qy 


467 


Db 


481 


Qy 


527 


Db 


541 



Qy 587 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 646 

N I II M I I I II I I I I I I I I I I I II I I I I I I I I | | | | | | | | | | | | M I II I I I I I I I I I I 

Db 601 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 660 

QY 647 EAMSVSL-KVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 705 

I I I I I I I I I I I I I I I I I I I I I I I | || | | | | || I | | | | | | | | | | | | | | || | | | || 

Db 661 EAMSVSLKKVSGI KEEI KEPENINAALQETEAPYI S IACDLI KETKLSAEPAPDFSDYS E 72 0 

706 MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 765 

N I I I I I I I I I I I I I I I I I I I M I M I M M I I I I I I I I I I I I I I I II I I II I I I I I I I I 

Db 721 MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 780 



766 EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 825 

I I I I I I I f I I I I I I | | I I || I | | | | | || | | | | | | | | | | | || | | | | 

EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 84 0 



Db 781 



^ 826 SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 885 

I I I I I I I I I I I I I I I I I I I I I I I I | | || | | | | | | | | | | || | | | | | | 

Db 841 SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 900 

Qy 886 ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGH 945 

I I I N I I I I I I I M I I I II II I I II II I I I I I | | | | | | | || | | | | | | | | | | | | || | | | 
Db 901 ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALA- 959 

Qy 946 TQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGV 1005 

I I I I I I I I I I I I I M I I I I II I II : I I I II I I I I M || I | | | || 

Db 960 TQAEI ES I VKPKVLVKE7VEKKLPSDTEKEDRS PSAI FSAELS KTSVVDLL YWRDI KKTGV 1019 

Qy 1006 VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLES 1065 

I I I I I I I I I I I I I I I II I I I I I I I M I I I I I I I I I I M I I I I I I I I I I M I I I I M M M 

Db 1020 VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLES 1079 

Qy 1066 EVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLT 1125 

N M I I I I I II I I I I I I I I I I I I I I I M I I I I II || I I I I M | | | | | | | | | | M I I I I I I 
Db 1080 EVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWFTYVGALFNGLT 1139 

Qy 1126 LLI LALI SLFSVPVI YERHQAQIDHYLGLANKNVKDAMAKIQAKI PGLKRKAE 1178 

N I I I I I I I I I II I I I I I I I I I I I I M I I I I I I I I II I I I I I I I I I I I | I I I I 
Db 1140 LLI LALI SLFSVPVI YERHQAQIDHYLGLANKNVKDAMAKIQAKI PGLKRKAE 1192 



RESULT 4 
US-09-789-386-2 

; Sequence 2, Application US/09789386 

; Patent No. US20020010324A1 

; GENERAL INFORMATION: 

; APPLICANT: MICHALOVICH, DAVID 

; APPLICANT: PRINJHA, RABINDER KUMAR 

; TITLE OF INVENTION: NOVEL COMPOUNDS 

; FILE REFERENCE: GP-30165-C1 

; CURRENT APPLICATION NUMBER: US/09/789,38 6 

; CURRENT FILING DATE: 2001-02-21 

; PRIOR APPLICATION NUMBER: U.K. 9916898.1 

; PRIOR FILING DATE: 1999-07-19 

; PRIOR APPLICATION NUMBER: U.K. 9816024.5 

; PRIOR FILING DATE: 1998-07-22 

; PRIOR APPLICATION NUMBER: US 09/359,208 



PRIOR FILING DATE: 1999-07-22 
; NUMBER OF SEQ ID NOS : 6 

; SOFTWARE: FastSEQ for Windows Version 3.0 
; SEQ ID NO 2 

LENGTH: 1192 

TYPE: PRT 
; ORGANISM: HOMO SAPIENS 
US-09-789-386-2 



Query Match 98.1%; Score 5810; DB 9; Length 1192; 

Best Local Similarity 97.2%; Pred. No. le-283; 

Matches 1159; Conservative 4; Mismatches 14; Indels 16; Gaps 3 

Q y 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

I I N I I II I I I I I I I I I I I I | | I | | | | | | | | | | | | | | | | | ! | I I | | | | | | | M | I I I I I I 
Db 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

Q y 61 AGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAP 120 

11 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 61 AGL S AAP VP T AP AAGAP LMD FGND FVP PAP RGP L PAAP P VAPE RQ P S WD P S P VS S T VPAP 120 

QY 121 S P L S AAAVS P S KL P EDD E P P ARP P P P P PAS VS PQAE P VWT P P APAPAAP P S T PAAP KRRG 180 

I I I I I I I I I I I I I I I I I I I I I I I I I || | || | | | | | M II I I I | 

Db 121 SPLS AAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 180 



Q Y 181 SSGA WXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSP 22 6 

1 1 1 : ' : = I M I M I I I I I I I I I II I I II I I II I || | | | M 

SSGSVDETLFALPAASEPVIRSSAENMDLKEQPGNTISAGQEDFPSVLLETAASLPSLSP 24 0 



Db 181 



Q y 227 LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 286 

I I I I I I I I I I I I I I I I I | | I I I I I I I I I I I | | | I I I I I I I I I | || | | | 

Db 241 LSAASFKEHEYLGNLSTVLPTEGTLQENVSE1ASKEVSEKAKTLLIDRDLTEFSELEYSEM 300 

Q y 287 GSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDEW 34 6 

I I I I I I I I I I I I I I I I I | I I I I I I I I I I I I I | | | | | | | | | | | M | | | | | | 

Db 301 GS S FSVS PKAESAVI VANPREEI I VKNKDEEEKLVSNNI LHNQQELPTALTKLVKEDEW 360 

347 SSEKAKDSFNEKRVAVEAPMREEYADFKPFERWEVKDSKEDSDMLAAGGKIESNLESKV 406 
I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | || | | | | | | | || | | | | | | | 

Db 361 SSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKV 420 

^ 407 DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAP FNPAATESIATNIF 4 66 

I I I I I I I I I I I I I I I II I I | | | | | | M I I I I I I I I I I I I I I I I I I I I I | | I I | | 

Db 421 DKKC FADS LEQTNHEKDSESSNDDTSFPSTPEGIKDRPGAYITCAP FNPAATESIATNIF 480 



^ 467 PLLEDPTSENXTDEKKIEEKKAQI VTEKNTSTKTSNPFFVAAQDSETDYWTDNLTKVTE 526 

111 MINI I I I I | | | | | | | || | | | | | | || | | | | | | | | | | | | || || | || | | | | | | | | 

PLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYWTDNLTKVTE 540 



Db 481 



Qy 527 EVVANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 58 6 

IIIIIIIIIIMIIIIIMIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIMIIII 

Db 541 EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 600 

Qy 587 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 64 6 

I I I I I I I I I I I I I I I I | | | | I I I I I II II I I I | | | | | | | | | | | | | | | | | | | | | | | 

Db 601 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 660 



Qy 647 EAMSVSL-KVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 7 05 

_ I I I I I I I I I M I I I I I I I I II I | I || M I I I I I I I I M I M I II I I II I I I I I I I I I I | 
Db 661 EAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 72 0 

Qy 706 MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 765 

M I I I I I I II I I I II I I || | | || | | || I I I I I I I I I I I | | | | | | | | | | | | | | | | | |,| | | 
Db 721 MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 780 

Q ¥ 766 EYENKEKLS7VLPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 825 
1 I I I I I I I I I | | | | || | || | | | | | | | I I I I I I M I I I I I I M IN 

Db 781 EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 840 

Qy 826 SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 885 

1 1 1 N I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | I I I I I I I I I I I I I I I I | | | | | 
Db 841 ^NDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 900 

Qy 886 ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVS7VLGH 94 5 

N I I I I II I I I I I I I I I I I I I I M I I I I I I I I M I I I M I I I I I M I I I I I I I I M I I 

Db 901 ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALA- 959 

Qy 946 TQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSVVT)LLYWRDIKKTGV 1005 

I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | || | || : | I I I | | | | M I I I I I I M I 
Db 960 TQAEIESIVKPKVLVKE7VEKKLPSDTEKEDRSPSAIFSAELSKTSWDLLYWRDIKKTGV 1019 

Qy 1006 VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLES 1065 

I I M I I I I I I I I I I I I I I I I | | | || | | || | | | | | | | | | || | | | | | || | | , | | | 

Db 1020 VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRI YKGVIQAIQKSDEGHPFRAYLES 1079 

Q y 1066 EVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLT 1125 
1 1 I I I I I I I I I II II I I | | | || | | | || | | | || | | | || | | | | | | | | | | | | || | | | | 

Db 1080 EVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLT 1139 

QY H26 LLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1178 

' I I I M | | M | | | | M I I I I I I I I I I I | | | | | | | | | | | | | | | 

Db 114 0 LLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1192 



RESULT 5 

US-09-893-348-23 

Sequence 23, Application US/09893348 
Patent No. US200200724 93A1 
GENERAL INFORMATION: 
APPLICANT: EI SEN BACH- SCHWARTZ , Michal 
APPLICANT: COHEN, Irun R. 
APPLICANT: BESERMAN, Pierre 
APPLICANT: MOSONEGO, Alon 
APPLICANT: MOALEM, Gila 

TITLE OF INVENTION: ACTIVATED T-CELLS, NERVOUS SYSTEM-SPECIFIC ANTIGENS AND 
THEIR USES 

FILE REFERENCE: EIS-SCHWARTZ-2A 

CURRENT APPLICATION NUMBER: US/09/893,34 8 

CURRENT FILING DATE: 2001-06-28 

PRIOR APPLICATION NUMBER: US 09/314,161 

PRIOR FILING DATE: 1999-05-19 

PRIOR APPLICATION NUMBER: US 09/218,277 

PRIOR FILING DATE: 1998-12-22 

PRIOR APPLICATION NUMBER: PCT/US98/14 715 



; PRIOR FILING DATE: 1998-07-21 

; PRIOR APPLICATION NUMBER: IL 124500 

; PRIOR FILING DATE: 1998-05-19 

NUMBER OF SEQ ID NOS : 2 9 
; SOFTWARE: Patentln version 3.1 
; SEQ ID NO 23 

LENGTH: 1192 

TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-893-348-23 



Query Match 98.1%; Score 5810; DB 9; Length 1192; 

Best Local Similarity 97.2%; Pred. No. le-283; 

Matches 1159; Conservative 4; Mismatches 14; Indels 16; Gaps 3 

MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

I I I I I I I I I I I I I I | | | M I I I I I I II I I I II I I | | | | 

MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

AGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAP 12 0 
I I I I I I I I I I I I I I I | | | I I I I I II I I I | | | || | | | | | | | | | | | | | | || | | || | | 

AGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAP 12 0 



yy 


i 


UD 


1 




C 1 

bl 


DD 


bl 


wy 






i oi 


wy 


1 ft 1 
J. O 1 


LJLJ 


1 p 1 
lol 


Ov 


f 


Db 


Z 1 x 


Qy 


287 


Db 


301 


Qy 


347 


Db 


361 


Qy 


407 


Db 


421 


Qy 


467 


Db 


481 


Qy 


527 


Db 


541 


Qy 


587 



180 



SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 
1 1 I I I I I I I I I I I I I I I I I I I I I I I | | | M | | | | | | | | | | | | | | | | | | || | | | , | , | ! 

SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 180 



SSGA WXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSP 226 

1 1 1 : I : = I I I I I I I I I I I I I I I I II I I I II | || | | | || | | 

SSGSVDETLFALPAASEPVIRSSAENMDLKEQPGNTISAGQEDFPSVLLETAASLPSLSP 24 0 

LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 286 
M 1 I I I I I I I I II M I I I I I I M I M II I I I I I I I I | I | | | | I I I I I II I II I I I I I I I I 
LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 300 

GSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDEW 34 6 

M I M ' I I I I I , I M | | | | | || | | | | | | I I I II I I I I I I I I I I I I I I I 

GSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHNQQELPTALTKLVKEDEW 360 

SSEKAKDSFNEKRVAVEAPMREEYADFKPFERWEVKDSKEDSDMLAAGGKIESNLESKV 406 

HI I I M I I I I I I I I I || | | | M I I I I I I I II I I I | | | | | | | | || | | 

SSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKV 420 



PKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIF 466 

I I I I I I I I I I I I I | || | | | | M | || I M I M I I I I I I I I I I I I I I I I 



I I I I I I I I II I I II I II | | | | || | | | | | | | | | I I I I | | | | 



1 1 1 1 11 > I I I I I I I I M I I I I I I I M M I I I I M I I I I II I I I I M I I I M I II I M I I I 

EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 600 

EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 646 
I I I II I II I I I I I I | | I II I I II II I I II I I I | || | || | | || I I I II I I I I I I I 



Db 601 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 660 

Qy 647 EAMSVSL-KVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 705 

r „ I I I I I I I I I I II I | | | | | | | | | | | | | | | | | | | | | | | | | | | || | , | | | , | | || M | , , , , 

Db 661 EAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 720 

Qy 706 MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 765 

_ 1 1 I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | I I I I II I I I I I II II I I 

Db 721 MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 780 

Qy 7 66 EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 825 

1 1 1 1 1 1 I I I I I I I I I I I I | | | I I I I I I I I I I I | | | | | | | | | | | | | | | | ! | | | | | | | | | ( | 

EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 840 



Db 781 



Qy 826 SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 8 85 

OA , I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | I I M M I I I I I I I | | | 

Db 841 SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 900 

Qy 886 ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGH 945 

_ I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | || | | | | | | | | | | | | | 

Db 901 ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALA- 959 

Qy 946 TQAEI ES I VKPKVLEKEAEKKLPSDTEKEDRS PSAI FSADLGKTS WDLLYWRDI KKTGV 1005 

I I I I I I I I I I I I I I II I II III I III III MINIM I Ml MM III I IN I 

Db 960 TQAEIESIVKPKVIiVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSVVDLLYWRDIKKTGV 1019 

Qy 1006 VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLES 1065 

I I I I I I I I I I I I I M M I II II II I II I I I II I II I M II I II II I | | || | || | | 

Db 1020 VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLES 1079 

Qy 1066 EVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLT 1125 

1 I I M I I I I I I I II II I I I I I I II M I I M I II I II I I I I I I I M I M I M I II I I I I II 

Db 1080 EVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLT 1139 

Qy 1126 LLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1178 

n _ I I II M I II I I I II I II M I II I I || II I I I II II II I 

Db 1140 LLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1192 



RESULT 6 

US-10-267-502-429 

; Sequence 429, Application US/10267502 

; Publication No. US20040071700A1 

; GENERAL INFORMATION: 

; APPLICANT: Kim, Jaeseob 

; APPLICANT: Galant, Ron 

; TITLE OF INVENTION: Obesity Linked Genes 
; FILE REFERENCE: LSD-07416 

; CURRENT APPLICATION NUMBER: US/10/267,502 

; CURRENT FILING DATE: 2003-01-27 

; NUMBER OF SEQ ID NOS : 439 

; SOFTWARE: Patentln version 3.2 

; SEQ ID NO 429 

LENGTH: 1192 
; TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-267-502-429 



T Mat ^ h 98 Sc °" 5810; DB 12; Length 1192; 

Local Similarity 97.2%; Pred. No. le-283; 
hes 1159; Conservative 4; Mismatches 14; Indels 16; Gaps 



1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 



1 

61 
61 
121 



11 1 < I I I I M I I M I I I I I I I I I I I I | | M I I I I I I I | | | | | | | | | M I I I I I I I I M I I 

MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 



60 



AGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAP 120 
1 1 1 1 1 1 1 11 I I M I M I M I I I I I M I I I I I I I I I M | | | | I M I I I I I M I I I I I I I I I 

AGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAP 120 



SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 180 

10 llllllllllllliiiiiMiiMiii Illlllllllllllllliiiniiiiii 

121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 



180 



181 SSGA WXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSP 226 

' LJL^, m ^ _i.JJ J 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

SP 240 



181 SSGSVDETLFALPAASEPVIRSSAENMDLKEQPGNTISAGQEDFPSVLLETAASLPSL 



227 LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 286 

0A , Jii 1111111 111 HI IMMIIIIIIIIIIIllliiiiiiiM 

241 LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 300 

287 GSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDEW 34 6 
I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | 

301 GSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHNQQELPTALTKLVKEDEW 360 

347 SSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKV 406 
I I M I I M I II II I I I I I I I I I II | | | | | | | | | | || | | | || | | | | | | | | | | | , , 

361 SSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKV 420 



407 



466 



DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIF 

A „ I ' I I I I I I I I I I I I I I I I I I II I I I I I I II I I I | I I I I I I I I I I I II I 

421 DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRPGAYITCAPFNPAATESIATNIF 480 

4 67 PLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTE 526 



' I I I I I I II I I I I I I I I II I | | | M I I I I I I I I I I I I I I I I 



I I I I 



481 PLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYWTDNLTKWE 540 

527 EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 586 

CA 1 1 1 11 1 1 I I I I I I I I I I I II I I I I I I | | | | | | II I I I I I I I I I I I I I I I I I I I 

541 EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 600 

587 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 646 
fn , N I I M I I I I I I I I I I I I I | || | | | | | | | | | | | | || | | | | || | | | | , | | | , 

601 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSWYESIKHEPENPPPYE 660 

647 EAMSVSL-KVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 705 

£C1 1 1 1 I I I I I I I I || | | || | || || | || || | I I I I I I I I I I I I I I I I I I I I I I I I I 

661 EAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 720 

706 MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 765 

I I I I I I M I I M I I I I I I II I I I I I I I I I I II I I I || | | | || | | || | | | | | | | | | || | || 
721 MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 780 



Qy 


766 


EYENKEKLSALPPEGGKP YLES FKLSLDNTKDTLLPDEVSTLSKKEKI PLQMEELSTAVY 

1 1 1 1 1 1 1 1 1 1 I | | | M | | | | | 1 I | 1 1 | 1 1 1 1 1 1 If 1 II |i 1 j 1 1 | | 1 | 1 M 1 | 1 j 

' 1 1 1 1 1 1 1 1 1 1 i i i i l l l l l l l l 1 1 [ 1 | | | | J | | 1 | ( | | | | | 

EYENKEKLSALPPEGGKP YLESFKLSLDNTKDTLLPDEVSTLSKKEKI PLQMEELSTAVY 


825 


Db 


781 


840 


Qy 


826 


SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 

! 1 1 1 1 M 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 I I I l I I i i I m i i i ii i ii i i i i i i i i i i i i 
1 1 ■ 1 1 1 1 1 1 ■ 1 1 ■ 1 1 ■ i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i | 1 | 

SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 


885 


Db 


841 


900 


Qy 


886 


ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGH 

1 1 1 1 1 1 1 1 I I | | | | | 1 1 1 1 1 1 I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 l l I I i i i i i i i 

1 1 1 1 1 1 1 1 1 i I i i i i 1 I I I I 1 1 1 1 1 1 1 1 1 I | | | | | | 1 | | | | | | 

ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALA- 


945 


Db 


901 


959 


Qy 


946 


TQAEIES I VKPKVLEKEAEKKLPSDTEKEDRS PSAI FSADLGKTS WDLLYWRDI KKTGV 
1 1 1 1 1 1 1 1 I 1 1 I | | | | 1 1 1 1 1 1 1 M 1 1 1 1 | | | | ii | ii . 1 l | 1 II 1 I I I i i 

11 1 ' 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 | | | 1 | | » 1 1 1 1 | | | 1 | | | [ | | J | | | | 

TQAEI ESI VKPKVLVKEAEKKLPSDTEKEDRS PSAI FSAELSKTSVVDLLYWRDI KKTGV 


1005 


Db 


960 


1019 


Qy 


1006 


VFGAS LFLLLS LTVFS I VS VTAYIALALLS VT I S FRI YKGVI QAI QKS DEGHP FRAYLE S 

1 1 1 1 I I I | | | | | | | | 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 | | | | | | | | 1 1 | | 1 | 1 , | . . , , 

1 1 1 1 1 1 1 1 1 1 1 1 1 i i i i l i i 1 1 1 1 1 1 1 1 1 1 1 I | | | | | 1 | | | | | | | | | | | 

VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRI YKGVI QAIQKSDEGHPFRAYLES 


1065 


Db 


1020 


1079 


Qy 


1066 


EVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLT 

1 1 | 1 1 I 1 1 1 1 1 1 1 1 1 1 I 1 l l r i i i i . , 

1 1 11 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 | | | | | | | | | M 1 1 1 II 1 1 1 1 1 1 1 1 M II 1 1 1 M 1 1 

EVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLT 


1125 


Db 


1080 


1139 


Qy 


1126 


LL I LALISLFSVP VI YERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1178 

1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 I || | | | | | | | | | | | | | | | || || M | | || | 

LLILALISLFSVPVIYERHQAQIDHYLGIANKWKDAMAKIQAKIPGLKRKAE 1192 




Db 


1140 





RESULT 7 
US-10-327-213-9 

; Sequence 9, Application US/10327213 

; Publication No. US2004 0121341A1 

; GENERAL INFORMATION: 

; APPLICANT: FILBIN, MARIE T. 

; APPLICANT: DOMENICONI, MARCO 

; APPLICANT: CAO, ZIXUAN 

; TITLE OF INVENTION: INHIBITORS OF MYELIN-ASSOCIATED GLYCOPROTEIN (MAG) 

; TITLE OF INVENTION: ACTIVITY FOR REGULATING NEURAL GROWTH AND REGENERATION 

; FILE REFERENCE: CUNY/ 003 **±±vn 

; CURRENT APPLICATION NUMBER: US/ 10/327 , 2 13 

; CURRENT FILING DATE: 2002-12-20 

; NUMBER OF SEQ ID NOS : 43 

; SOFTWARE: Patentln Ver. 2.1 

; SEQ ID NO 9 

LENGTH: 1192 
; TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-327-213-9 



Query Match 98.1%; Score 5810; DB 16; Length 1192; 

Best Local Similarity 97.2%; Pred. No. le-283; 

Matches 1159; Conservative 4; Mismatches 14; Indels 16; Gaps 3; 

QY 1 MFDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | 
Db 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 



61 AGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAP 120 

I I I I I I I > I I I M I I I I I I M I I I I M I I I M M I I I I I I I I I I M I I I M I I M I I I I I 

61 AGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAP 120 
121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 180 

101 IINIIIIMMIIIIIMMlllliiiiiiMIIIII Illllllllll 

121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 180 



181 SSGA WXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSP 226 

\Hl r I; : I I I I I I I I I I I I I I I II I I I I I I I I I I I | | | | | 

PSLSP 240 



il SSGSVDETLFALPAASEPVIRSSAENMDLKEQPGNTISAGQEDFPSVLLETAASLP5 



227 LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 286 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I II I I I I I Mil 

241 LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 300 

287 GSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDEW 34 6 

_ 1 1 1 1 1 I M I I I I I I I I I I I I | | | | | I I I I I I I I I I I I I I I I I I I I I I I I | | | M 

301 GSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHNQQELPTALTKLVKEDEW 360 



347 



406 



SSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKV 

I I I I I I I I I I I I I | | | | | | | | I I I I I I I I I I I II I II I II I I I II I I I M I I II I 

361 SSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKV 420 



407 



DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIF 466 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | || | | | | | | || | | | | | | | | 

421 DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRPGAYITCAPFNPAATESIATNIF 480 



467 



PLLEDPTSENXTDEKKIEEKICAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTE 52 6 

111 MINI I II II I I I I I I I I I I | | | | | | || | || | || | || | | | | | | | || || , | | | | 

481 PLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLTKVTE 540 

527 EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 586 

I I I I I I I I I I I I I II I M I I II I HI I I I I I I I I I I I I | I I I I I I I I I I I | M II I M I I 
541 EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 600 

587 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 646 

_ I I N I I I I I I I I I I I I II I I II I I I I I I I || | || | | M | || | | | | | | | 

601 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 660 

647 EAMSVSL-KVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 705 
1 1 < I M I I I I I I I II I I II I I I I | | | | | | | | || | | | | | | | || | | | || || | | | | | 

661 EAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 720 
706 MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 765 

1 1 I I I I I I I I I I I I I M M I M I I I II I I I I I M I I I I M I | I I I I I I I I I I I I I I I I I I 

721 MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 780 



766 EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 825 

I I I I I N I I I I I I I I I I I I I I I | M II II I I I I I I M I I II M I I I I M I I I I I I I I I I I 
7 81 EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 840 

826 SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 885 
I I I I I I I I I I I I I I I I I | || | | | | || | | | | || || | |, | | | | | M | M | | 

841 SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 900 



Qy 


886 


Db 


901 


Qy 


946 


Db 


960 


Qy 


1006 


Db 


1020 


Qy 


1066 


Db 


1080 


Qy 


1126 


Db 


1140 



ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGH 945 
11 1 1 1 1 I I I I I I I I I I I I I I I II I I | M I I I I I I I I I | | | | | | M | | | | | | | | | | ! | , 



959 



IN II Mil III II I I I I I I I I I I I I I I I | | | | | t I I I : I || | | | | || | | | | | | | | | 



^ » -^j-yauuijonr £ ru-vx JjCjO lUDJ 

1 1 1 1 1 I M I M I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | , | , | | | | | | M M , 

VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLES 1079 

EVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLT 1125 

1111 I I I I I I I I I I I M M I I I I I I I I I I I I I I I I I I I I | | | | | | | | | || | | M | 

EVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLT 1139 

LL I LALI S LFS VPVI YERHQAQI DHYLGLANKNVKDAMAKI QAKI P GLKRKAE 1178 
I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | Mm 



RESULT 8 
US-10-466-258-9 

; Sequence 9, Application US/10466258 

; Publication No. US2004 0132096A1 

; GENERAL INFORMATION: 

; APPLICANT: GLAXO GROUP LIMITED 

; TITLE OF INVENTION: ASSAY 

; FILE REFERENCE: P80966 GCW 

; CURRENT APPLICATION NUMBER: US/ 10/4 66, 258 
; CURRENT FILING DATE: 2003-07-15 
; NUMBER OF SEQ ID NOS: 13 
; SOFTWARE: Patentln version 3.0 
; SEQ ID NO 9 
; LENGTH: 1192 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-466-258-9 



Query Match 98.1%; Score 5810; DB 16; Length 1192; 

Best Local Similarity 97.2%; Pred. No. le-283; 

Matches 1159; Conservative 4; Mismatches 14; Indels 16; Gaps 3; 

MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

1 1 1 1 < I I M I I I I I II I I I I I II I I I I I M I I I M I I I M I M I || | I || | | | || 

MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

AGL S AAP VP T AP AAGAP LMD FGND FVP PAP RG PL PAAP P VAP ERQ P S WD P S P VS S T VPAP 12 0 
''''''' I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | 
AGL S AAP VP TAP AAGAP LMD FGN D FVP PAP RGP L PAAP P VAP E RQ P S WD P S P VS S T VP AP 120 

SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 180 

' ' F ' 1 1 ' ' ' r ' ' I f I I I I I I I I I I I I I 1 I I t i I I I I I r I I f 1 I I I F 1 I I I ! I f I f I I I I I 

SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 180 

SSGA WXXXXKIMDLKEQPGNTI SAGQEDFPSVLLETAASXPSLS P 226 

1 1 1 : I : ' I I I I N I I I I I I I I I I I I I I I I I I I | | I | M I I 



Qy 


l 


Db 


i 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 



181 SSGSVDETLFALPAASEPVIRSSAENMDLKEQPGNTISAGQEDFPSVLLETAASLPSLSP 240 

227 LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 286 
N M I I I I I I I I I I | | | | M I I I I I I | | | | | | | | | | | | | | | | || | | | | , | | | , | | | | M | 

LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 300 



241 



287 GSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHXQQEL 



301 



»«— PTALTKLVKEDEW 346 

N I M I I I I I I I I M | | | | | | | | | | | ! | | ! , | , , , , m I I I I I I I I I I I I I | | | 

GSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHNQQELPTALTKLVKEDEW 360 



34 7 SSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKV 406 
ilL'J'ili 1 1 1 1 1 11 1 1 1 1 I I I I I I I I I I I I I I I I | | | | | | | | | | | | | I I I I I I | | | | | 

ESNLESKV 420 



361 S S E KAKD S FNE K RVAVEAPMRE E YAD FK P FE RVWE VKD S KE D S DMLAAGGK I 



407 
421 
467 
481 
527 



DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIF 4 66 

I I I I I I I I I I I M I I I I I I I I I I | | | | | | | | M I I I II I I I | | | | | | | | | | | I 

DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRPGAYITCAPFNPAATESIATNIF 4 80 

PLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTE 526 
111 M >IM I I I I I I I I I I I I I I I I I | | | | | M | || | | || | | | | || | | | | 

PLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLTKVTE 540 



EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 586 

^1' 1 1 11 11 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | I II I I I I | 

541 EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLC- " " 



PSF 600 



587 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 646 

_ I I IN II I I I I I I I I I I I I I I I I I II I I I I I I I I I | | | I I I I I I I I I I I I I I I I 

601 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 660 

647 EAMSVSL-KVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 705 

CC1 1 I I I I I I I I I I I I I I I I I I I II I I M I I I I I I I I I I I I I I I | | | | | 

661 EAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 720 

706 MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 765 

^ 1 1 N I II I I I I I | I I II I I I I I I I | | | | | I I I I I I I I I I I | | 

721 MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 780 



766 
781 



EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 825 

I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I M | I I I I I I I I I I I | I I I I 

EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 840 

826 SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 885 

0/11 I I I I I I I I I I I I I I I I I I I I I I II I II I | | | | | | | | | | | | | | | | | | || , | | | | | 

841 SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 900 

886 ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGH 945 

„ I I I I I I M I M I I I I I I | | | | | | | | I I I I I I I | | | II I I I I I I I I I I I I I I 

901 ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALA- 959 

946 TQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGV 1005 

Qcn Him Ml INI I I Mill II III I MINI I: i MINI MM MINIM 

960 TQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSWDLLYWRDIKKTGV 1019 



1006 
1020 



VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLES 1065 

I I Ml I Ml I M I M I II I II II I I I II I M II II M II || | || | || || I 

VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLES 1079 



Qy 1066 EVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLT 



Db 



, Aon 1 11 1 n 1 1 ii 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1, 1 1 1, |, iY M y 1 7; 1125 

1080 EVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLT 1139 



Qy 1126 LLI LALI SLFSVPVI YERHQAQI DHYLGLANKNVKDAMAKIQAKI PGLKRKAE 1178 

11/n '' IIIMM IIIIIIIIIIIIIIIIMI|||||||||||IMI|||||||||| 

Db 1140 LLI LAL I SLFSVPVI YERHQAQI DHYLGLANKNVKDAMAKIQAKI PGLKRKAE 1192 

RESULT 9 
US-10-408-967-7 

Sequence 7, Application US/10408967 
Publication No. US2004 0063161A1 
GENERAL INFORMATION: 
APPLICANT: Pharmacia & Upjohn Company 
APPLICANT: Yan, Riqiang 
APPLICANT: Lu, Yifeng 

^ 2 r p ° 5itions and Methods ° f T " ati ° 9 M!iei -"' ! 

CURRENT APPLICATION NUMBER: US/10/408,967 
CURRENT FILING DATE: 2003-04-08 
NUMBER OF SEQ ID NOS : 9 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 7 
LENGTH: 1192 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-408-967-7 

Query Match 97.8%; Score 5794; DB 12; Length 1192; 

Best Local Similarity 97.0%; Pred. No. 6.6e-283- 

Matches 1157; Conservative 4; Mismatches 16; Indels 16; Gaps 3; 
Qy 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 

, 11111 Nil II I || | | | | | | | | | | || | || | | | | n | | | | | , | , | , , M 

Db 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 



60 



QY 61 AGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAP 12 0 

C1 I I' I I III I Ml Ml IMIMIIIIII Ml Ml IMMIMII MINIUM 

61 AGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAP 12 0 



Db 



Qy 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 18 0 

' ' ' iiiiJiiiiiliiiliiiliiilllJ 11 11 M I I II I I II II | I I I II I I 

STPAAPKRRG 18 0 



Db 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPP 



Qy 181 SSGA WXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSP 226 

III, ,: : M I I II I II I I I II II I II I I II I I I I I Mill 



Db 181 SSGSVDETLFALPAASEPVIRSSAENMDLKEQPGNTISAGQEDFPSVLLETAAS 



LPSLSP 240 



Qy 227 LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 286 

0J11 I M I M M M M II I II II I II I II I I I II I I M I I II I I I II I I I I I 

Db 241 LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 300 

Qy 287 GSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDEW 34 6 



U' 1 M M I I M I I II I I || II I II I II I I II | II II I I I I I I I M II I II I M I I I I 

iVKEDEW 360 



Db 301 GSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHNQQELPTALTKL' 



347 SSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKV 406 

, ei 1 1 1 I I I I M I M I I I I I I I I I I I I I I | | | | | | | | | I I I I I | | | M I I I I I I I I I I | | | M 

361 SSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKV 420 

407 DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIF 4 66 



421 
4 67 
481 
527 
541 
587 
601 



1 1 1 1 1 I I I M I I I I I I I I I I I I I I I I M I I I I II I I I I II | | | | | | | | | | | || | | | | | | 
DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRPGAYITCAPFNPAATESIATNIF 480 

PLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTE 526 
111 I I I I I I I I I I I I II || | | || | | | | || | | || | || I | II II I I I I I I I I I | | | | | | 

PLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLTKVTE 540 

EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 586 
1 1 11 I I I I I I I I I I I I I I I M I I I I I M I I I I I I I I I I I I I | M | I I I I I I I I | | | | | M 
EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 600 

EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 
1 1 1 1 11 I I I II I I I I I I I I I I | | | | | | I I I I I I I II I I I I I I I I | | M | | | | | | | 

EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 



647 EAMSVSL-KVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 
1 1 1 11 1 I I N I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | 
EAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 



661 
706 
721 
766 
781 
826 
841 
886 



MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 
1 11 1 11 1 < > I I I I I II II I I I I I I I I II I I I I I I I I M I I I I I I I I I I I I I I I I I II I II 
MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 



646 
660 
705 
720 

765 
780 



EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 825 
1111 I I I I I I I I I I I I I I I II I I I I I I I I | || | | | | | | | | | | | | | | | | 

EYENKEKLSALPPEGGKPYLESFKLSLINTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 



840 
885 



SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 

1 1 1 1 1 1 11 > M I I I I I I I I I I I I I I | | | | | | | | | | | M I I I I I I II I I I I I I I I I 

SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 900 



ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGH 94 5 
Qni 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | I I I I I 
901 ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALA- 959 



946 
960 
1006 
1020 



TQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKT3WDLLYWRDIKKTGV 1005 
I I I I I I I I I I I I I I III || | | | | | | | | | || || | : | | | I I I I I M I I | | 

TQAEIESIVKPICVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSWDLLYWRDIKKTGV 



1019 



VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLES 1065 

1 1 1 1 1 1 1 1 1 I I M I I I I I I I I II I I I I I I I I I I II I I I I I I I I I I I | | | | M I I I I I I I 

VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFPAYLES 1079 



1125 



1066 EVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLT 
inft 1 1 1 1 1 1 1 1 I I I I I I I I | | | | | | | | I I I I I I I I I I | | I I I I II I I I I I I | | | | | | | | | I I I 
1080 EVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLT 1139 



1126 
1140 



LLI LALI S LFSVPVI YERHQAQI DH YLGLANKNVKDAMAKI QAKI PGLKRKAE 1178 

N I I I I I I I I M I I I I I I I | | | | M | | M | I I I II I I I I I I I I | | | | M I I I I 
LLILALI S LFSVPVI YERHQAQI DHYLGLANKNVKDAMAKI QAKI PGLKRKAE 1192 



RESULT 10 
US-09-893-348-18 

; Sequence 18, Application US/09893348 
; Patent No. US200200724 93A1 
; GENERAL INFORMATION : 

; APPLICANT: EI SENBACH- SCHWARTZ , Michal 
; APPLICANT: COHEN, Irun R. 
; APPLICANT: BESERMAN, Pierre 
; APPLICANT: MOSONEGO, Alon 
; APPLICANT: MOALEM, Gila 

^7JuL°J INVENTI0N: ACTIVATED T-CELLS, NERVOUS SYSTEM- SPECIFIC ANTIGENS AND 
lflijlR USES 

FILE REFERENCE: EIS-SCHWARTZ-2A 
CURRENT APPLICATION NUMBER: US/09/893, 348 
CURRENT FILING DATE: 2001-06-28 
PRIOR APPLICATION NUMBER: US 09/314,161 
PRIOR FILING DATE: 1999-05-19 
PRIOR APPLICATION NUMBER: US 09/218,277 
PRIOR FILING DATE: 1998-12-22 
PRIOR APPLICATION NUMBER: PCT/US98/14715 
PRIOR FILING DATE: 1998-07-21 
PRIOR APPLICATION NUMBER: IL 124500 
PRIOR FILING DATE: 1998-05-19 
NUMBER OF SEQ ID NOS : 29 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 18 
LENGTH: 1163 
TYPE: PRT 

ORGANISM: Rattus norvegicus 
US-09-893-348-18 

Query Match 72.5%; Score 4296.5; DB 9; Length 1163; 

Best Local Similarity 74.0%; Pred. No. 1.3e-207; 

Matches 885; Conservative 104; Mismatches 156,- Indels 51; Gaps 19; 

Qy 1 ^EDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 5 8 

, 111:111 11111 I I I I I I I I I I I I I I | | | | | | I I I I I : I | :: | | | M | | | M | I I 
Db 1 MICDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

Qy 59 PAAGL S AAP VPTAP AAGAP LMD FGN DFVP PAP RG P L P AAP P VAP ERQ P S WD P S P VS S T VP 118 

' 1111 Ml Mhll :| I | MMI || || || | | ||; || : | 

Db 61 PAAGLSAAAVP — PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAA— P 115 

Qy 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 178 

111 I I II I I I I | | || || | | | | | | | | | | || || I I I I I I | | I I | | | 

Db 116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAPPSTPAAPKR 166 

Qy 179 RGS SG AWXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSL 224 

, £n 1 1 1 : M I I I I I I I I I : I : I M M I I I I I I I I I I I I I 

Db 167 RGSGSVDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSL 226 

Qy 225 SPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYS 284 

' 1 1 1 ''III I I I I I I I : I I I : : I : : | | I I I : I : I : : I I I I I I M I M 

Db 227 SPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYS 286 

Qy 285 EMGSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDE 344 



404 
397 



I I I I I I I I I I I I = : I I : I I : I I: : I | : | : | | : | | | | | | | | 

287 EMGSSFKGSPKGESAILVENTKEEVIVRSKDKED-LVCSAALHSPQESPVG KEDR 340 

345 WSSEKAKDSFNEKRVAVEAPMREEYADFKPFERWEVKDSKEDS-DMLAAGGKIESNLE 403 

111 11 I HI : : : I I I : I i I I I II I I I I : MIM: | | | : | | | : | : | 

341 WSPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAA RANVE 396 

SKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIAT 4 63 

IIM: II Mill : I I I I hi II : || | MINI I : I I II 

SKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCASFT-SATESTTA 455 

464 NIFPLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTK 523 

_ 1 11 I I I I I M I I :| I I I: I I I || MMII II M I | MINI |:| 

456 NTFPLLEDHTSENKTDEKKIEERKAQIITEK-TSPKTSNPFLVAVQDSEADYVTTDTLSK 514 

524 VTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLC 
C1C 111 l : IMI III MM II I I III III I I I I I I I II I : I I I I i I I :||||ll MM 
515 VTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQLC 

584 PSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPENP 642 
' 1 1 ' I : I I I I I I N II I I I I I I II I : I II I M I : II I Mill |:|:||| Mill 
PSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPENP 634 



575 



583 
574 



643 PPYEEAMSVSLKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSD 702 

coc HIHII 1 ^!! I M Mill: I II : II II II II | | || II I I I I I M I I Mill: 

635 PPYEEAMNVALKALGTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDFSN 694 

703 YSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFE 762 

£nc 111:11 1= ' ' - I 1 I I I f I I = I I I I I I I I I I I I I = I I I |:| I 11:1111 II I I 

695 YSEIAKFEKSVPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS-E 753 

7 63 SMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELST 822 

: : : : : : I : I I I I I M I I M I I : : I : M I : : : M : I II I I I I II I : I 
754 TVAQHK-EERLSASPQELGKPYLESFQPNLHSTKDA-ASNDIPTLTKKEKISLQMEEFNT 811 

823 AVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHK 882 

010 |: :|:IMIIIIIIIIIIIIIIII :|:| || I II : I I I I I II I I 

812 AIYSNDDLLSSKEDKIKESETFSDSSPIEIIDEFPTFVSAKDDS-PKLAKEYTDLEVSDK 870 

883 SEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSA 942 

1 1 I I I I I I I I I I I I I II I M II : : : I I : II : | | : || : I : I II 
871 SEIANIQSGADSLPCLELPCDLSFKNIYPK— DEVHVSDEFSENRSSVSKASISPSNVSA 928 

943 LGHTQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKK 1002 

„' I I: I I I I I I M I II II I I M: M:| MMII I 

929 L-EPQTEMGSIVKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIKK 987 

1003 TGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAY 1062 

noa ' I ' I I I I I I II II I I | II | | | | | | || I II II II II II II I I II I 

988 TGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAY 1047 

1063 LESEVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFN 1122 



1048 



I I I I I I I I I I I I M I I I I I I I I I I I I II I I II I I II II I II II II I I II II II I 

LESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFN 



1107 



1123 GLTLL I LAL I SLFS VP VI YERHQAQI DH YLGLANKNVKDAMAKI QAKI PGLKRKAE 1178 
1 1 1 1 I 1 1 I I I I I M : I I I I I I I I IMIIIIIIIIMIMIIIIIMMIIIIII: 



Db 



1108 GLTLLILALISLFSIPVIYERHQVQIDHYLGLANKSVKDAMAKIQAKIPGLKRKAD 1163 



RESULT 11 
US-10-267-502-431 

Sequence 431, Application US/10267502 
Publication No. US20040071700A1 
GENERAL INFORMATION: 
APPLICANT: Kim, Jaeseob 
APPLICANT: Galant, Ron 

TITLE OF INVENTION: Obesity Linked Genes 
FILE REFERENCE: LSD-07416 

CURRENT APPLICATION NUMBER: US/10/267,502 
CURRENT FILING DATE: 2003-01-27 
NUMBER OF SEQ ID NOS : 439 
SOFTWARE: Patentln version 3.2 
SEQ ID NO 431 
LENGTH: 1163 
TYPE: PRT 

ORGANISM: Mus musculus 
US-10-267-502-431 

Query Match 72 .2%; Score 4277.5; DB 12; Length 1163; 

Best Local Similarity 73.8%; Pred. No. l.le-206; 

Matches 883; Conservative 108; Mismatches 154,' Indels 51; Gaps 19; 

QY 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKP 59 

, 111:11 ' I I I I I I M I I I I I I : I I : I I M :: | | | | | | | | | | | | | | 

Db 1 MEDIDQSSLVSSSADSPPRPPPAFKYQFVTEPEDEEDEEDEEEEEDDEDLEELEVLERKP 60 

QY 60 AAGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPA 119 

' 1 1 1 1 1 1 1 1 1 1 1 1 I I : I I : I | | | | | | | | | | | | | | | | | , | | | | : | | . | , 
Db 61 AAGLSAVPVP— PAA-APLLDFSSDSVPPAPRGPLPAAPPTAPERQPSWERSPAAS— A 114 

Qy 120 PSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRR 179 

11C 11 1111 I I ' I I I I I I I I I I || M || || I,,, ||M| 

Db 115 PSLPPAAAVLPSKLPEDDEPPAR — PPAPAGASPLAE PAAPPSTPAAPKRR 163 

Qy 180 GSSG AWXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLS 225 

n , 1 1 : I I I I I I I I I I I I : I : I M | | || | | | | | | | | || | 

Db 164 GSGSVDETLFALPAASEPVIPSSAEKIMDLKEQPGNTVSSGQEDFPSVLFETAASLPSLS 223 

Qy 226 PLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSE 285 

Ml I I I I I I I I I I I I I I I I : : | :: | | | : | : | : | .. , . | | | | | | | | 
Db 224 PLSTVSFKEHGYLGNLSAVASTEGTIEETLNEASRELPERATNPFVNRESAEFSVLEYSE 2 83 

QY 286 MGSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDEV 345 

11111= Ml lll::| I :||:||::||:|: || : || || | |||:|lll I 
2 84 MGSSFNGSPKGESAMLVENTKEEVIVRSKDKED-LVCSAALHNPQESPATLTKWKEDGV 342 



Db 



Qy 34 6 VSSEKAKDSFNEKRVAVEAPMREEYADFKPFERWEVKDSKEDS-DMLAAGGKIESNLES 4 04 

nK ! ' ' ' 1 11 1 :: : 1 I I : I I I I | I I I I I I : I I II I : I | I : I I I : I : I I 

Db 343 MSPEKTMDIFNEMKMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAA RANMES 398 

Qy 4 05 KVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATN 4 64 



Db 



HIIM INN :| HIM l:: |||i|| : , , ,,,, 



• 1 ' • • i i i i i i i : I I | I I I I | | : | | | | | | 

399 KVDKKCFEDSLEQKSHGKDSESRNENASFPSTPELVKDGSRAYITCDSFT-SATESTAAN 457 



Qy 525 
Db 517 



Qy 4 65 IFPLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKV 524 

Ml:|| I INI INN |||:|| ||:||| || HUH I, ||| ||||MIIMI 
Db 458 IFPVLEDHTSENKTDEKKIEERKAQIITEK-TSPKTSNPFLVAIHDSEADYVTTDNLSKV 516 

TEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCP 584 
11 111 I I I I I I I I I I I I I I I I I I I I I IN I I III I : I I I I | | | : IN : | | | | | | I 

TEAWATMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESIYPTAQLCP 576 

Qy 585 SFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPENPP 643 

HI 1:111 I M I II I III II III I :|| INI l||:|| II I I : I : I I | | | | | | 
Db 577 SFEEAEATPSPVLPDIVMEAPLNSLLPSTGASVAQPSASPLEVPSPVSYDGIKLEPENPP 636 

Qy 644 PYEEAMSVSLKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDY 7 03 

11111111:111 I I I I I I I I : I I I I I I I I I I I I | | I I I I I I I I I I I : I ||: I 
Db 637 PYEEAMSVALKTSDAKEEIKEPESFNAAAQEAEAPYISIACDLIKETKLSTEPSPGFSNY 696 

Qy 704 SEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFES 7 63 

£M I I : I I I : I I I I I I I : I I | | : | I I : I I I I : I I I I : I I I I I I I I : 

Db 697 SEIAKFEKSVPDHCELVDDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS-ET 755 

Qy 764 MIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTA 823 

„ c ! :!:S||:| Ml III: :| III M: 11:111 I Mill :|| 
Db 756 VTQHKHKERLSASPQEVGKPYLESFQPNLHITKDA-ASNEIPTLTKKETISLQMEEFNTA 814 

Qy 824 VYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKS 883 

: 1 1 1 1 I 1 I I I ::: I : I I I M I M I II I I I I II : I : I I I I Ml I II I : I I 

Db 815 IYSNDDLLSSKEDKMKESETFSDSSPIEIIDEFPTFVSAKDDS PKEYTDLEVSNKS 870 

Qy 884 EIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDV3AL 943 

I I I I I I I I II : I I I | | I II III : I I : I I I : I : | | | I I I : M M 
Db 871 EIANVQSGANSLPCSELPCDLSFKNTYPKDEAHV — SDEFSKSRSSVSKVPLLLPNVSAL 928 

Qy 944 GHTQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKT 1003 

: I I : : I I I M I I I II Ml I I I I I I I II II : I : MM I I I M I II II I I II I I 
Db 929 -ESQIEMGNIVKPKVLTKEAEEKLPSDTEKEDRSLTAVLSAELNKTSWDLLYWRDIKKT 987 

Qy 1004 GW- FGAS LFLLLS LTVFS I VS VTAYIALALLSVT I S FRI YKGVI QAI QKS DEGHPFRAY 1062 

QOO 111 1 I I I I I I I I I II I I I I I I I II I I I II M M I I II II I | | | | | || I 

Db 988 GVVYFGASLFLLLSLTVFS I VSVTAYIALALLSVTIS FRI YKGVI QAI QKS DEGHPFRAY 1047 

Qy 1063 LESEVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFN 1122 

inilo 1111111 1 IN H I I II Ml Ml I III III MM MM Ml II I 

Db 1048 LESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFN 1107 



Qy 1123 GLTLLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1178 

„ no ""IMIIIIIIIilllllllllllllllllllhllllllllllllllllllll 

1108 GLTLLI LALI SLFSI PVI YERHQAQI DHYLGLANKS VKDAMAKIQAKI PGLKRKAE 1163 



Db 



RESULT 12 
US-09-789-386-6 

; Sequence 6, Application US/09789386 

; Patent No. US20020010324A1 

; GENERAL INFORMATION: 

; APPLICANT: MICHALOVICH, DAVID 

; APPLICANT: PRINJHA, RABINDER KUMAR 



; TITLE OF INVENTION: NOVEL COMPOUNDS 
; FILE REFERENCE: GP-30165-C1 

; CURRENT APPLICATION NUMBER: US/09/789, 386 

; CURRENT FILING DATE: 2001-02-21 

; PRIOR APPLICATION NUMBER: U.K. 9916898.1 

; PRIOR FILING DATE: 1999-07-19 

; PRIOR APPLICATION NUMBER: U.K. 9816024.5 

; PRIOR FILING DATE: 1998-07-22 

; PRIOR APPLICATION NUMBER: US 09/359,208 

; PRIOR FILING DATE: 1999-07-22 

; NUMBER OF SEQ ID NOS : 6 

; SOFTWARE: FastSEQ for Windows Version 3.0 
; SEQ ID NO 6 

LENGTH: 373 
; TYPE: PRT 

ORGANISM: HOMO SAPIENS 
US-09-789-386-6 



Query Match 25 , 2%; Score 1495 . 5; DB g . fch 3 

Best Local Similarity 31.6%; Pred. No. 2e-67; 

Matches 372; Conservative 1; Mismatches ' 0; Indels 805; Gaps 1; 



MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

AG L S AAP VP T AP AAGAP LMD FGN D FVP PAP RG P L P AAP P VAP E RQ P S W D P S P VS S T VP AP 120 

I I I I I I I I I I I I I I I I I | | I I I I I I I I I II I II I II I I I I I I I | | M | M I II 

AGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAP PVAPERQP SWDP S PVS STVPAP 12 0 

SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 180 

111111 N I N I I M I I I I I I I I I I I I I I I I M I I I I I I || I I I I I I I I i M || i 

SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 180 

SSGAWXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSPLSAASFKEHEYLGN 24 0 

SSGSV _ n 

185 

LSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEMGSSFSVSPKAESAV 300 

185 

IVANPREEIIVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDEWSSEKAKDSFNEKRV 360 

185 

AVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKVDKKCFADSLEQTNH 420 

185 



185 

KKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTEEWANMPEGLTPDL 54 0 



Qy 


l 


Db 


l 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


186 


Qy 


301 


Db 


186 


Qy 


361 


Db 


186 


Qy 


421 


Db 


186 


Qy 


481 


Db 


186 



QY 541 VQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSFEESEATPSPVLPDI 600 
Db 186 

185 

Qy 601 VMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYEEAMSVSLKVSGIKE 660 
Db 186 

Qy 661 EIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQPVPDHSELV 720 
Db 186 

185 

Qy 721 EDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMIEYENKEKLSALPPEG 780 
Db 186 

185 

QY 781 GKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVYSNDDLFISKEAQIRE 840 
Db 186 

185 

Qy 841 TETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEIANAPDGAGSLPCTEL 900 
Db 186 



Q ¥ 901 PHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGHTQ, 



185 

AEIESIVKPKVLE 960 



Db 186 

185 

Qy 961 KEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGWFGASLFLLLSLTVF 1020 

Db 1flfi M I I I M I I I I I I I I | | | | | | | | | | M | | | 

° WDLLYWRDIKKTGWFGASLFLLLSLTVF 215 

Qy 1021 SIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNS 1080 

o _ ''''' ' ' I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | 
Db 216 S I VS VTAYI ALALLS VT I S FRI YKGVI QAIQKS DEGHP FRAYLES EVAI S EELVQKYSN S 275 

Qy 1081 ALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSVPVI 1140 

„ Ml I II I III Mil I II Mill I I II III | III || | | IMM II Ml Mill I 

Db 276 ALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSVPVI 335 

H41 YERHQAQI DHYLGLANKNVKDAMAKIQAKI PGLKRKAE 1178 
I I II II I I I M II II II II I II M II I I I I I I || | || | 
Db 336 YERHQAQI DHYLGLANKNVKDAMAKIQAKI PGLKRKAE 373 

RESULT 13 
US-09-765-205-6 

Sequence 6, Application US/09765205 
Patent No. US2 0020034800A1 
GENERAL INFORMATION: 
APPLICANT: Cao, Li 

TITLE OF INVENTION: BONE MARROW SECRETED PROTEINS AND POLYNUCLEOTIDES 
FILE REFERENCE: 1458.004/200130.44 9 
CURRENT APPLICATION NUMBER: US/ 09/7 65 , 2 05 
CURRENT FILING DATE: 2001-01-17 
PRIOR APPLICATION NUMBER: US/09/212 , 440 



PRIOR FILING DATE: 1998-12-16 
NUMBER OF SEQ ID NOS : 4 6 

SOFTWARE: Fast SEQ for Windows Version 3.0 
SEQ ID NO 6 
LENGTH: 373 
TYPE: PRT 
ORGANISM: human 
US-09-765-205-6 

Query Match 25.2%; Score 1495.5; DB 9; Length 373- 

Best Local Similarity 31.6%; Pred. No. 2e' 67 - 

Matches 372; Conservative 1; Mismatches ' 0; Indels 805; Gaps 1 
QY 1 MEDLDQSPLVSSSDSP^ 

Db 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEED^ 60 

QY 61 AGL S-^Py PTAPAAGAP LMD FGND FVP PAP RGP L P AAP P VAP ERQ P S WD P S P VS S T VPAP 120 

^ NIMIMMMIIIIIIli I II! Mil || | Ill I II II MINIM 

Db 61 AGLSAAPVPTAPAAGAPLMD FGND FVP PAP RGPL PAAP P VAP ERQ PSTO l20 

QY 121 spls^vspsklpedd^ 

nK 11 11 111 11 II II I M I I Mill MM IMIIII II MM Mill MM I II I || 

Db i2i splsaaavspsklpeddepparppppppasvspqaepwtppApap^ 180 

QY 181 SSGAWXXXXKIMDLKEQPGNTISAGQEDFPSVXLETAASXPSLSPLS^ 240 
Db 181 SSGSV 

185 

Qy 241 LSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEMGSSFSVSPKAESAV 300 
Db 186 



Qy 301 IVANPREEIIVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDEWS 



185 

SEKAKDSFNEKRV 360 



Db 186 

185 

Qy 361 AVEAPMREEYADFKPFERWEVKDSKEDSDMLAAGGKIESNLESKVDKKCFADSLEQTNH 420 
Db 186 

185 

QY 421 EKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIFPLLEDPTSENXTDE 480 
Db 186 

185 

Qy 481 KKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTEEWANMPEGLTPDL 540 
Db 186 

185 

Qy 541 VQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSFEESEATPSPVLPDI 600 
Db 186 

185 

Qy 601 VMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYEEAMSVSLKVSGIKE 660 
Db 186 

185 



Qy 661 EIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQPVPDHSELV 720 
Db 186 

185 

QY 721 EDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMIEYENKEKLSALPPEG 780 
Db 186 

185 

QY 781 GKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVYSNDDLFISKEAQIRE 840 
Db 186 

185 

QY 841 TETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEIANAPDGAGSLPCTEL 900 
Db 186 

185 

QY 901 PHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGHTQAEIESIVKPKVLE 960 
Db 186 

185 

Qy 961 KEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGWFGASLFLLLSLTVF 1020 

Db 186 — - - I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | 

VVDLLYWRDIKKTGWFGASLFLLLSLTVF 215 

Qy 1021 sivsvtayi^ 

Db 216 SIVSWAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVM 275 

QY 1081 ALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSVPVI 1140 

n , nn£ 1 1 11 1 11 1 N I I I I I I | | | I I I I I I I | | I I I I I I | | | II I I I I I I M I I I I I I | | | I M I 

Db 276 ALGHWCTIKELRRLFLVDDLVDSLKFAVI^FTWGALFNGLTLLILALISLFS^ 335 

Q Y 1141 YERHQAQI DHYLGLANKNVKDAMAKIQAKI PGLKRKAE 1178 

0 I I I I I I I I I M I I I I I I M I I I I I I M | I I I I I I I I I I 
Db 33 ^ YERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 373 

RESULT 14 
US-09-893-348-24 

Sequence 24, Application US/09893348 
Patent No. US20020072493A1 
GENERAL INFORMATION: 
APPLICANT: EI SENBACH- SCHWARTZ , Michal 
APPLICANT: COHEN, Irun R. 
APPLICANT: BESERMAN, Pierre 
APPLICANT: MOSONEGO, Alon 
APPLICANT: MOALEM, Gila 

THEIR^SEf INVENTI0N: ACTIVAT ^D T-CELLS, NERVOUS SYSTEM-SPECIFIC ANTIGENS AND 

FILE REFERENCE: EIS-SCHWARTZ=2A 
CURRENT APPLICATION NUMBER: US/09/893, 348 
CURRENT FILING DATE: 2001-06-28 
PRIOR APPLICATION NUMBER: US 09/314,161 
PRIOR FILING DATE: 1999-05-19 
PRIOR APPLICATION NUMBER: US 09/218,277 
PRIOR FILING DATE: 1998-12-22 
PRIOR APPLICATION NUMBER: PCT/US98/14 715 



; PRIOR FILING DATE: 1998-07-21 

; PRIOR APPLICATION NUMBER: IL 124500 

; PRIOR FILING DATE: 1998-05-19 

; NUMBER OF SEQ ID NOS : 2 9 

; SOFTWARE: Patentln version 3.1 

; SEQ ID NO 24 

LENGTH: 373 

TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-893-348-24 

Query Match 25.2%; Score 1495.5; DB 9; Length 373; 

Best Local Similarity 31.6%; Pred. No. 2e-67 • 

Matches 372; Conservative 1 ; Mismatches ' 0; Indels 805; Gaps 1 

QY 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

nh , ' I" ' I I I I I I I I I I I I I I I I I I I M I I I I I I I | | | | | | M I I I I I I I I I I I I I I | 

Db 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

QY 61 AGL S AAP VPTAP AAGAP LMD FGND FVP PAP RG P L P AAP P VAP ERQ P S WD P S P VS S T VP AP 12 0 

^ N I I I I I I I I | | | | I II I I I I I I I I I I I I I | I I I I I I | I I I I I II I I I I I I I I II 

Db 61 AGL S AAP VP TAP AAGAP LMD FGN D FVP PAP RG P L P AAP P VAP ERQPSWDPS P VS S T VP AP 120 

Qy 121 SPLSAAAV SPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPST 



PAAPKRRG 180 

PSKLPEDDEPPARPPPPPPASVSPQAEPVWTP^ 180 



Db 121 spisi^si^' Hiiiililiiiiii i U- 1 1 I I I I I N I M I I I I I I I I M I I I I I I | 



QY 181 ^^ A |^ XXXKIMD LKEQPGNTISAGQEDFPSVLLETAASXPSLSPLSAASFKEHEYLGN 240 
Db 181 SSGSV 

185 

Qy 241 LSTVLPTEGTLQENVS EASKEVSEKAKTLLIDRDLTEFSELEYSEMGSSFSVSPKAESAV 300 
Db 186 

185 

Qy 301 I VANPREEI I VKNKDEEEKLVSNNI LHXQQELPTALTKLVKEDEWS SEKAKDS FNEKRV 360 
Db 186 

185 

Qy 361 AVEAPMREE YAD FKP FERVWEVKDS KEDS DMLAAGGKI E SNLES KVDKKC FAD S LEQTNH 42 0 
Db 18 6 



Qy 421 EKDSESSNDDTSFPSTPEGIKDRSGAYITCAP FNPAATESIATNIFPLLEDPT 



185 

SENXTDE 4 80 



Db 186 

185 

Qy 481 KKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTEEWANMPEGLTPDL 540 
Db 18 6 



Q y 541 VQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPS 



185 

FEESEATPSPVLPDI 600 



Db 186 __ 



QY 601 VMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYEEAMSVSLKV, 



SGIKE 660 



Db 186 

185 

QY 661 EIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQPVPDHSELV 720 
Db 186 

185 

QY 721 EDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMIEYENKEKLSALPPEG 780 
Db 186 — - 

185 

Qy 781 GKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVYSNDDLFISKEAQIRE 840 
Db 186 

185 

Qy 841 TETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEIANAPDGAGSLPCTEL 900 
Db 186 

185 

Qy 901 PHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGHTQAEIESIVKPKVLE 960 
Db 186 

185 

QY 961 KEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGWFGASLFLLLSLTVF 1020 

Db 18 , _ _ I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | 

° WDLLYWRDIKKTGWFGASLFLLLSLTVF 215 

Qy 1021 SIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNS 1080 

01C ' 1 1 1 11 1 1 1 ' 1 1 I ' I ' I ' I I ' 1 f I I ( f I I f I I I I f I I J I I I I I I I I I I | | I f | | | 
Db 216 S I VS VTAYI ALALL S VT I S FRI YKGVI QAI QKS DEGHP FRAYLES EVAI S EELVQKYSNS 275 

Qy 1081 ALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSVPVI 1140 

0 „ 1 11 I I I I I I N I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | 

Db 276 ALGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLI1ALISLFSVPVI 335 

Q y 1141 YERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1178 

I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | 

Db 336 YERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 373 

RESULT 15 
US-10-408-967-8 

Sequence 8, Application US/10408967 
Publication No. US2 0040063161A1 
GENERAL INFORMATION: 
APPLICANT: Pharmacia & Upjohn Company 
APPLICANT: Yan, Riqiang 
APPLICANT: Lu, Yifeng 

lll?£lsZ^™l 2 C r° Bltl °" a "" h0da ° £ T "" i "" «^»«'° "i—. 

CURRENT APPLICATION NUMBER: US/1 0/ 4 08 , 967 
CURRENT FILING DATE: 2003-04-08 
NUMBER OF SEQ IDNOS: 9 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 8 
LENGTH: 373 
TYPE: PRT 

ORGANISM: Homo sapiens 



US-10-408-967-8 

Query Match 25.2%; Score 1495.5; DB 12; Lenath 373- 

Best Local Similarity 31.6%; Pred. No 2e-67- 

Matches 372; Conservative 1; Mismatches ' 0; Indels 805; Gaps 1 
QY 1 MEDLDQSPLVSSS^ 

Db 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEE 60 

QY 61 AG L S AAP VP TAPT^GAP LMD FGN D FVP PAP RG P L PAAP P VAP E RQ P S WD P S P VS S T VP AP 120 

Db 61 AGL S AAP VPTAP AAGAP LMD FGND FVP PAP RG P L P AAP P VAP ERQ P S WD P S P VS S T VP AP 120 

QY 121 SPLS^VSPSKLPEDD^ 

nK ' ' ' ' ' ' ' ' 1 1 1 1 1 1 1 1 1 11 11 1 I I I I I I I I I I I I | I I I I I I I I I | | | I I II I I I I I I I I I 

Db 121 SPLS^VSPSKLPEDDEPPARPPP^ 



Qy 181 SSGAWXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSPL 



G 180 
SAAS FKEHE YLGN 24 0 



Db 181 SSGSV . 

185 

Qy 241 LSTVLPTEGTLQENVS EASKEVSEKAKTLLIDRDLTEFSELEYSEMGSSFSVSPKAESAV 300 
Db 186 



Qy 301 I VANPREEI I VKNKDEEEKLVSNNILHXQQELPTALTKLVKEDEWS 



185 

SEKAKDSFNEKRV 360 



Db 186 

185 

Qy 361 AVEAPMREE YADFKPFERVWEVKDS KEDS DMLAAGGKI E SNLES KVDKKC FADS LEQTNH 420 
Db 186 

185 

QY 421 EKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIFPLLEDPTSENXTDE 480 
Db 186 — 



QY 4 81 KKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTEEWANMPEGLT 



185 
PDL 540 



Db 186 

185 

Qy 541 VQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSFEESEATPSPVLPDI 600 
Db 186 : 

185 

Qy 601 VMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYEEAMSVSLKVSGIKE 660 
Db 18 6 

185 

Qy 661 EIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQPVPDHSELV 720 
Db 186 

185 

QY 721 EDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMIEYENKEKLSALPPEG 780 
Db 186 

185 



Qy 781 GKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVYSNDDLFISKEAQIRE 840 
Db 186 185 

Qy 841 TETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEIANAPDGAGSLPCTEL 900 
Db 186 ____ 

Qy 901 PHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGHTQAEIESIVKPKVLE 960 

Db 186 

185 

Qy 961 KEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGWFGASLFLLLSLTVF 1020 

Db 1fifi II MM III II I MINI 

10 b WDLLYWRDIKKTGWFGASLFLLLSLTVF 215 

Qy 1021 SIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNS 1080 

' ' I ' N II II II II I II II | || | | | | | || | || | | | | | | || | ,| | || | , 

Db 216 SIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNS 275 

Qy 1081 ALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSVPVI 1140 

1 1 I II M I I II II I II I I I || I I II II II I I II I I || | | | | || || | 

Db 276 ALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWFTYVGALFNGLTLLILALISLFSVPVI 335 

Qy 1141 YERHQAQI DHYLGLANKNVKDAMAKIQAKI PGLKRKAE 1178 

M M I M I M I I II I II I I II I I II I I I I | || | | || || 
Db 336 YERHQAQI DHYLGLANKNVKDAMAKIQAKI PGLKRKAE 373 



Search completed: September 3, 2004, 16:30:08 
Job time : 123.281 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



September 3, 2004, 15:59:39 ; Search time 74.0403 Seconds 

(without alignments) 
5019.975 Million cell updates/sec 

US-09-830-972-29 
5923 

1 MEDLDQSPLVSSSDSPPRPQ VKDAMAKIQAKIPGLKRKAE 1178 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 1017041 seqs, 315518202 residues 

Total number of hits satisfying chosen parameters: 



1017041 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : SPTREMBL_25 : * 



1: 




sp archea:* 


2: 




sp bacteria:* 


3: 




sp fungi : * 


4 




sp human:* 


5 




sp invertebrate:* 


6 




sp mammal : * 


7 




sp mhc:* 


8 




sp organelle:* 


9 




sp_phage: * 


10: 


sp plant:* 


11: 


sp rodent:* 


12 


sp virus : * 


13 


sp vertebrate:* 


14 


sp unclassified:* 


15 


sp rvirus:* 


16 


: sp bacteriap:* 


17 


: sp archeap:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


4 851 


81 , 


. 9 


986 


4 


Q8IUA4 


Q8iua4 homo sapien 


2 


42 89 


72 . 


► 4 


1162 


11 


Q8BGM9 


Q8bgm9 mus musculu 


3 


/ion c 

4277 . 5 


72 . 


, 2 


1163 


11 


Q8K3G8 


Q8k3g8 mus musculu 


4 


3658 . 5 


61 , 


. 8 


1046 


11 


Q8BGK7 


Q8bgk7 mus musculu 


5 


2580 . 5 


43 . 


. 6 


720 


11 


Q7TNB7 


Q7tnb7 mus musculu 


6 


2527 


42 . 


. 7 


639 


11 


Q8K290 


Q8k290 mus musculu 


7 


2233 


37 , 


. 7 


578 


11 


Q80W95 


Q8 0w95 mus musculu 


8 


1531 


25 . 


. 8 


392 


4 


Q96B16 


Q 9 6b 16 homo sapien 


9 


1201 . 5 


20 . 


. 3 


375 


11 


Q8BHF5 


Q8bhf5 mus musculu 


10 


1163 


19 , 


, 6 


356 


11 


Q8BH78 


Q8bh7 8 mus musculu 


11 


1152 . 5 


19 , 


, 5 


357 


11 


Q8K3G7 


Q8k3g7 mus musculu 


12 


901 


15 . 


. 2 


184 


6 


Q7YRW9 


Q7yrw9 bos taurus 


13 


887 


15 . 


, 0 


199 


13 


Q7T224 


Q7t224 gallus gall 


14 


867 


14 . 


, 6 


179 


6 


Q9GM33 


Q9gm33 macaca fasc 


15 


788.5 


13 . 


. 3 


760 


13 


Q90638 


Q90638 gallus gall 


16 


779 


13 . 


. 2 


780 


11 


Q8K4S4 


Q8k4s4 mus musculu 


17 


778 


13 . 


, 1 


780 


11 


Q8K0T0 


Q8k0t0 mus musculu 


18 


755 . 5 


12 . 


. 8 


214 


13 


Q7T222 


Q7t222 carassius a 


19 


699 


11 . 


. 8 


643 


11 


Q8CCU2 


Q8ccu2 mus musculu 


20 


688 


11 . 


, 6 


199 


4 


Q9BQ59 


Q9bq59 homo sapien 


21 


672 


11 . 


. 3 


208 


13 


Q90637 


Q90637 gallus gall 


22 


671 


11 . 


. 3 


267 


11 


Q63765 


Q63765 rattus sp. 


23 


626 


10 , 


. 6 


237 


11 


Q8C6D5 


Q8c6d5 mus musculu 


24 


623.5 


10 . 


. 5 


236 


11 


Q8VBU0 


Q8vbu0 rattus norv 


25 


586 


9 . 


, 9 


221 


13 


Q7ZUD6 


Q7zud6 brachydanio 


26 


566 


9 . 


. 6 


595 


5 


Q9VMV9 


Q9vmv9 drosophila 


27 


528 


8 . 


. 9 


234 


5 


Q9VMW3 


Q9vmw3 drosophila 


28 


528 


8 . 


, 9 


2484 


5 


Q9U347 


Q9u347 caenorhabdi 


29 


526 


8 . 


. 9 


224 


5 


Q9VMW1 


Q9vmwl drosophila 


30 


520 


8 . 


, 8 


222 


5 


Q9VMW4 


Q9vmw4 drosophila 


31 


519 . 5 


8 , 


. 8 


2607 


5 


Q23187 


Q23187 caenorhabdi 


32 


518 


8 . 


. 7 


202 


5 


Q9VMW2 


Q9vmw2 drosophila 


33 


358 . 5 


6 . 


, 1 


1417 


3 


Q871Y7 


Q871y7 neurospora 


34 


357 . 5 


6, 


, 0 


2768 


5 


Q9VC0 0 


Q9vc00 drosophila 


35 


343 


5 . 


. 8 


107 


13 


Q7T223 


Q7t223 carassius a 


36 


341.5 


5 . 


. 8 


5412 


5 


Q9W596 


Q9w596 drosophila 


37 


340.5 


5 . 


, 7 


222 


5 


Q23188 


Q23188 caenorhabdi 


38 


339 


5 . 


. 7 


1150 


5 


Q8IMM6 


Q8imm6 drosophila 


39 


334 


5 . 


. 6 


16215 


5 


Q9NFS3 


Q9nfs3 drosophila 


4 0 


334 


5 , 


, 6 


18074 


5 


j — s flTTTT A 

Q9I7U4 


Q9i7u4 drosophila 


41 


331 


5. 


.6 


7962 


4 


Q10465 


Q10465 homo sapien 


42 


331 


5. 


. 6 


34350 


4 


Q8WZ42 


Q8wz42 homo sapien 


43 


325.5 


5. 


.5 


5327 


5 


076891 


076891 drosophila 


44 


322.5 


5. 


.4 


846 


5 


Q8MRP6 


Q8mrp6 drosophila 


45 


322.5 


5, 


.4 


1109 


5 


Q9VAY4 


Q9vay4 drosophila 



ALIGNMENTS 



RESULT 1 
Q8IUA4 

ID Q8IUA4 PRELIMINARY; PRT; 986 AA. 

AC Q8IUA4; 

DT 01-MAR-2003 (TrEMBLrel. 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 



DE RNT4 (RTN4) . 

GN RTN4. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Oertle T., van der Putten H. f Schwab M.E.; 

RT "Genomic Structure and Functional Characterization of the Promoter 

RT Structures of Human and Mouse Nogo/Rtn-4 . " ; 

RL Submitted (OCT-2002) to the EMBL/GenBank/DDB J databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RA Oertle T . , Schwab M.E.; 

RL Submitted (MAY-2002) to the EMBL/GenBank/DDB J databases. 

RN [3] 

RP SEQUENCE FROM N.A. 

RA Van der Putten H. ; 

RL Submitted (MAY-2002) to the EMBL/GenBank/DDB J databases. 

RN [4] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Testis; 

RX MEDLINE=22376540; PubMed=124 88097 ; 

RA Oertle T., van der Putten H., Schwab M.E.; 

RT "Genomic Structure and Functional Characterization of the Promoter 

RT Structures of Human and Mouse Nogo/Rtn-4."; 

RL J. Mol. Biol. 325:299-323(2003). 

DR EMBL; AY102285; AAM64244.1; 

DR EMBL; AY123245; AAM64249.1; 

DR EMBL; AY123246; AAM64250.1; 

DR EMBL; AY123247; AAM64251.1; 

DR EMBL; AY123248; AAM64252.1; 

DR EMBL; AY123249; AAM64253.1; 

DR EMBL; AY123250; AAM64254.1; 

DR GO; GO: 0005783; C : endoplasmic reticulum; IEA. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

SQ SEQUENCE 986 AA; 108449 MW; 0CDE8F64 70364 15A CRC64 ; 



Query Match 81.9%; Score 4851; DB 4; Length 986; 

Best Local Similarity 98.9%; Pred. No. 2.4e-231; 

Matches 976; Conservative 1; Mismatches 8; Indels 2; Gaps 2 

Qy 193 MDLKEQPGNTISAGQEDFPSVLLETAASXPSLSPLSAASFKEHEYLGNLSTVLPTEGTLQ 252 

I I I I I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I II I I I I I I I II I I I I I I II I 
Db 1 MDLKEQPGNTISAGQEDFPSVLLETAASLPSLSPLSAASFKEHEYLGNLSTVLPTEGTLQ 60 

Qy 253 ENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEMGSSFSVSPKAESAVIVANPREEIIVK 312 

I I I I I I I II 1 I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 
Db 61 ENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEMGSSFSVSPKAESAVIVANPREEIIVK 12 0 



Qy 313 NKDEEEKLVSNNI LHXQQELPTALTKLVKEDEWS S EKAKDS FNEKRVAVEAPMREEYAD 372 

I I I I I I I I I I I I I I I I I I I II I I I I I I 1 M I I I I I II I I I I I II I I I I I I I I I I I I I I I 
Db 121 NKDEEEKLVSNNI LHNQQELPTALTKLVKEDEWS S EKAKDS FNEKRVAVEAPMREEYAD 180 



Qy 373 FKPFERVWEVKDSKEDSDMLAAGGKIESNLESKVDKKCFADSLEQTNHEKDSESSNDDTS 432 

I I i II I I I I I I I I I I I I I I I I I I I I I II III I II I i I I I I I I I I I I II I I I I I I I I | I I I 
Db 181 FKPFERVWEVKDSKEDSDMLAAGGKIESNLESKVDKKCFADSLEQTNHEKDSESSNDDTS 240 

Qy 433 FPSTPEGIKDRSGAYITCAPFNPAATESIATNIFPLLEDPTSENXTDEKKIEEKKAQIVT 492 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I II I I I I I II I I 
Db 241 FPSTPEGIKDRSGAYITCAP FNPAATESIATNIFPLLGDPTSENKTDEKKIEEKKAQIVT 300 

Qy 493 EKNTSTKTSNPFFVAAQDSETDYWTDNLTKVTEEWANMPEGLTPDLVQEACESELNEV 552 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I II I I I I 
Db 301 EKNTSTKTSNPFLVAAQDSETDYVTTDNLTKVTEEWANMPEGLTPDLVQEACESELNEV 360 

Qy 553 TGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSFEESEATPSPVLPDIVMEAPLNSAVPS 612 

1 I I I I I I I I 1 I I I M I I I I I I I I I I I I I I I I I II I I I I 1 I I I I I I I I I I I I II I M I I I I 
Db 3 61 TGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSFEESEATPSPVLPDIVMEAPLNSAVPS 420 

Qy 613 AGASVIQPSSSPLEASSVNYESIKHEPENPPPYEEAMSVSL-KVSGIKEEIKEPENINAA 671 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I M I II I II I I I I 
Db 421 AGASVIQPSSSPLEASSVNYESIKHEPENPPPYEEAMSVSLKKVSGIKEEIKEPENINAA 4 80 

Qy 672 LQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQPVPDHSELVEDSSPDSEPVD 731 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I M I I I I I M I I I II 
Db 481 LQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQPVPDHSELVEDSSPDSEPVD 540 

Qy 732 LFSDDSIPDVPQKQDETVMLVKESLTETSFESMIEYENKEKLSALPPEGGKPYLESFKLS 791 

I I M I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 
Db 541 LFSDDSIPDVPQKQDETVMLVKESLTETSFESMIEYENKEKLSALPPEGGKPYLESFKLS 600 

Qy 792 LDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVYSNDDLFISKEAQIRETETFSDSSPIE 851 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 
Db 601 LDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVYSNDDLFISKEAQIRETETFSDSSPIE 660 

Qy 852 IIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEIANAPDGAGSLPCTELPHDLSLKNIQP 911 

I I I I I I I I I II I I I I I I I I I I I I M I I I I I I I I I I I II I I I I I M I I I I I I I I I I I I I II 
Db 661 IIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEIANAPDGAGSLPCTELPHDLSLKNIQP 720 

Qy 912 KVEEKISFSDDFSKNGSATSKVLLLPPDVSALGHTQAEIESIVKPKVLEKEAEKKLPSDT 971 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I II I I I I 
Db 721 KVEEKISFSDDFSKNGSATSKVLLLPPDVSAIA-TQAEIESIWPKVLVKEAEKKLPSDT 779 

Qy 972 EKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYI7VL 1031 

I I I I I I I I I I I I I : I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I II I I 
Db 780 EKEDRS PS AI FSAELS KT SVVDLL YWRDI KKTGWFGAS LFLLLS LTVFS I VS VTAYI AL 839 

Qy 1032 ALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNCTIKE 1091 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 
Db 840 ALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNCTIKE 899 

Qy 1092 LRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSVPVIYERHQAQIDHY 1151 

M II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I II I I I I II I I I 
Db 900 LRRLFLVDDLVT)SLKFAV1jMWFT 959 

Qy 1152 LGLANKNVKDAMAKIQAKIPGLKRKAE 1178 

I I I I I I I I I I I I I I I I I I I I II I I I I I 
Db 960 LGLANKNVKDAMAKIQAKIPGLKRKAE 986 



RESULT 2 
Q8BGM9 
ID 
AC 
DT 
DT 
DT 
DE 
GN 
OS 



PRELIMINARY; 



PRT; 1162 AA. 



Created) 

Last sequence update) 
Last annotation update) 



Q8BGM9 
Q8BGM9; 

01-MAR-2003 (TrEMBLrel. 23, 
01-MAR-2003 (TrEMBLrel. 23, 
01-OCT-2003 (TrEMBLrel. 25, 
RTN4. 
RTN4. 

Mus mus cuius (Mouse) . 
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 
OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
OX NCBI_TaxID=10090; 
RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=129/SvcJ7, and 129SvcJ7; 

RA Oertle T., van der Putten H., Schwab M.E.; 

RT "Genomic Structure and Functional Characterization of the Promoter 

RT Structures of Human and Mouse Nogo/Rtn-4 . " ; 

RL Submitted (OCT-2002) to the EMBL/ GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=129/SvcJ7, and 129SvcJ7; 

RA Oertle T., Schwab M.E.; 

RL Submitted (MAY-2002) to the EMBL/ GenBank/DDBJ databases. 
RN [3] 

RP SEQUENCE FROM N.A. 
RC STRAIN=129/SvcJ7; 
RA Van der Putten H.; 

RL Submitted (MAY-2 002) to the EMBL/ GenBank/DDBJ databases. 
RN [4] 

RP SEQUENCE FROM N.A. 

RC STRAIN=12 9Svc J7 ; 

RA Van der Putten H., Mir A. ; 

RL Submitted (MAY-2 002) to the EMBL/ GenBank/ DDB J databases. 
DR EMBL; AY102284; AAM73506.1; -. 
DR EMBL; AY102286; AAM73511.1; 
DR MGD; MGI: 1915835; Rtn4 . 

DR GO; GO:0005783; C : endoplasmic reticulum; IDA. 
DR GO; GO: 0007399; P : neurogenesis ; IDA. 
DR InterPro; IPR003388; Reticulon. 
DR Pfam; PF02453; Reticulon; 1. 
DR PROSITE; PS50845; RETICULON; 1. 

SQ SEQUENCE 1162 AA; 126613 MW; 855697FBEE11781F CRC64; 



Query Match 72.4%; Score 4289; DB 11; Length 1162; 

Best Local Similarity 73.9%; Pred. No. 1.7e-203; 

Matches 883; Conservative 109; Mismatches 153; Indels 50; 



Gaps 



18; 



Qy 

Db 

Qy 

Db 



1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKP 

I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I 1 : I I I I :: I I I I I I I I I I I I I I 
1 MEDIDQSSLVSSSADSPPRPPPAFKYQFVTEPEDEEDEEDEEEEEDDEDLEELEVLERKP 



59 



60 



119 



60 AAGL S AAP VP T AP AAGAP LMD FGN D FVP PAP RG P L PAAP P VAP ERQPSWDPS P VS S T VP A 

I II I I I I I I I III I I I : I I : I I I I I I I I I II I I I I I I I I I I I I : I I : I 

61 AAGLSAAPVP — PAA-APLLDFS SDS VP PAPRGPLPAAP PTAPERQP SWERS PAAS A 114 



Qy 120 PSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRR 179 

M I I II I I I I I I I I I I I I I I I I I II II I | | | | | || | | | || | 

Db 115 PSLPPAAAVLPSKLPEDDEPPAR — PPAPAGASPLAE PAAPPSTPAAPKRR 163 

Qy 180 GSSG AWXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLS 225 

II : I I I II I I I I II I : I : I I I I II I I I I I I II I I M 

Db 164 GSGSVDETLFALPAASEPVIPSSAEKIMDLKEQPGNTVSSGQEDFPSVLFETAASLPSLS 223 

Qy 226 PLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSE 285 

III I I I I I I I II II I I I I I : : I : : I I I : I : I : I : : I : I I I I I || I 

Db 224 PLSTVSFKEHGYLGNLSAVASTEGTIEETLNEASRELPERATNPFVNRESAEFSVLEYSE 283 

Qy 286 MGS S FS VS PKAESAVI VANPREEI I VKNKDEEEKLVSNNI LHXQQELPTALTKLVKEDEV 345 

I I I I I : III I I I : : I I : I I : I I : : I I : I : II : I I I I I I I I : I I , I I 
Db 284 MGSSFNGSPKGESAMLVENTKEEVIVRSKDKED-LVCSAALHNPQESPATLTKWKEDGV 342 

Qy 34 6 VSSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDS-DMLAAGGKIESNLES 4 04 

: I I I I I I I : : : I I I : I I I I I I II I I I : I I I I I : I I I : I I I : I : I I 

Db 343 MSPEKTMDIFNEMKMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAA RANMES 398 

Qy 4 05 KVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATN 4 64 

I I I I I I I I I I I I I I I I I I I :: I I I I I I : I I I I I I I I I : : I I II I I 
Db 399 KVDKKCFEDSLEQKGHGKDSESRNENASFPRTPELVKDGSRAYITCDSFS-SATESTAAN 457 

Qy 4 65 IFPLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYWTDNLTKV 52 4 

I I I : I I I I I I I I I I I I I I I : I I I I : I I I II MINI II III I I I I I I I I : I I 
Db 458 IFPVLEDHTSENKTDEKKIEERKAQIITEK-TSPKTSNPFLVAIHDSEADYVTTDNLSKV 516 

Qy 525 TEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCP 584 

M III I I I I I I II II I I I I I I I I I I I I I I I I I I I I : I I I II I I : I I I : I I I I I I I 
Db 517 TEAVVATMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESI YPTAQLCP 576 

Qy 585 SFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPENPP 643 

I I II : I I I I I I I I I I I I I I I I I I I : I I I I I I 111:1111 I I : I : II MINI 
Db 577 SFEEAEATPSPVLPDIVMEAPLNSLLPSTGASVAQPSASPLEVPSPVSYDGIKLEPENPP 636 

Qy 644 PYEEAMSVSLKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDY 703 

11111111:11 I I I I I II I I : III II I I I I I I I I I I I I I I I I I I I I : I : I I : I 
Db 637 PYEEAMSVALKTSDSKEEIKEPESFNAAAQE^VEAPYISIACDLIKETKLSTEPSPEFSNY 696 

Qy 704 SEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFES 763 

I I : I I I : I I I I I M : I I I I : I I I I I I I I I I I I I : I I I I : I I I I : I I I I I I I I : 
Db 697 SEIAKFEKSVPDHCELVDDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS-ET 755 

Qy 7 64 MIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTA 823 

: : : : : I I : I I I I I I I I I I I I I : : I III : | : | | : | | | I Mill : | | 

Db 756 VTQHKHKERLSASPQEVGKPYLESFQPNLHITKDA-ASNEIPTLTKKETISLQMEEFNTA 814 

Qy 824 VYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKS 883 

: I I I I I I III : : : I : I I I I I I I I I I II I I I I I : I : I II : I I II I II I : I I 

Db 815 IYSNDDLLSSKEDKMKESETFSDSSPIEIIDEFPTFVSAKDDS PKEYTDLEVSNKS 870 

Qy 884 EIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSAL 943 

I I I I I I I I I I : I II I II I I III : I I : I I I : I : I I I I I I : I I I I 
Db 871 EIANVQSGANSLPCSELPCDLSFKNTYPKDEAHV — SDEFSKSRSSVSKVPLLLPNVSAL 928 

Qy 944 GHTQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKT 1003 



: I I : : I I I I M I I I I I : I I I I I I I I I I I I : I : I I : I I I I I I I I I I I I I I I I I 
Db 929 -ESQIEMGNIVKPKVLTKEAEEKLPSDTEKEDRSLTAVLSAELNKTSWDLLYWRDIKKT 987 

Qy 1004 GVVFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYL 1063 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 988 GWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYL 1047 

Qy 1064 ESEVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNG 1123 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 
Db 104 8 ESEVAISEELVQKYSNSALGHWSTIKELRRLFLVDDLVDSLKFAVIJyiWVFTYVGALFNG 1107 

Qy 1124 LTLLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1178 

I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I : I I I I I I II I I I I I I I I I I I I 
Db 1108 LTLLI LALI SLFS I PVI YERHQAQI DHYLGLANKS VKDAMAKIQAKI PGLKRKAE 1162 



RESULT 3 
Q8K3G8 

ID Q8K3G8 PRELIMINARY; PRT; 1163 AA. 

AC Q8K3G8; 

DT 01-OCT-2002 (TrEMBLrel. 22, Created) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Nogo-A. 

GN RTN4 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=BALB/ c ; 

RA Jin W., Long M. , Li R., Ju G. ; 

RT "Cloning and expression of the mouse Nogo-A protein."; 

RL Submitted (MAY-2002) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AY114152; AAM77068.1; 

DR MGD; MGI : 1915835 ; Rtn4 . 

DR GO; GO: 0005783; C : endoplasmic reticulum; IDA. 

DR GO; GO: 0007399; P : neurogenesis ; IDA. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

SQ SEQUENCE 1163 AA; 126691 MW; 6B5F362799417EA4 CRC64; 

Query Match 72.2%; Score 4277.5; DB 11; Length 1163; 

Best Local Similarity 73.8%; Pred. No. 6.2e-203; 

Matches 883; Conservative 108; Mismatches 154; Indels 51; Gaps 19; 

Qy 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKP 59 

111:111 I I I I I I I I I I I I I I I I I I I I I II I I : II : I I II :: I I II I I I I I I I I I I 
Db 1 MEDIDQSSLVSSSADSPPRPPPAFKYQFVTEPEDEEDEEDEEEEEDDEDLEELEVLERKP 60 

Qy 60 AAGL S AAP VPT AP AAGAP LMD FGND FVP PAP RGP L P AAP P VAP ERQ P S WD P S P VS S T VP A 119 

I I I I I I I I I t 1 I I II : I I : I I I I I I I I I I I I I II I I I I I II I : I I : I I 
Db 61 AAGLSAVPVP — PAA-APLLDFS S DS VPPAPRGPLPAAP PTAPERQP SWERS PAAS A 114 



Qy 



120 PSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRR 179 



Db 115 PSLPPAAAVLPSKLPEDDEPPAR — P PAPAGAS PLAE PAAPPSTPAAPKRR 163 

Qy 180 GSSG AVVXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLS 225 

II : I I I I I I I I I I I I : I : II I I I I I I I I I I I I I I I I 

Db 164 GSGSVDETLFALPAASEPVIPSSAEKIMDLKEQPGNTVSSGQEDFPSVLFETAASLPSLS 223 

Qy 226 PLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSE 285 

III I I I II I I I I I I I I I I I : : I : : I I I : I : I : I : : I : MINIM 

Db 224 PLSTVSFKEHGYLGNLSAVASTEGTIEETLNEASRELPERATNPFVNRESAEFSVLEYSE 283 

Qy 286 MGS S FSVS PKAESAVI VANPREEI I VKNKDEEEKLVSNNI LHXQQELPTALTKLVKEDEV 345 

I I I II : I II II I : : I I : I I : II : M I : I : II : M III II M II I I I 
Db 284 MGS S FNGS PKGESAMLVENTKEEVI VRS KDKED-LVCSAALHNPQES PATLTKWKEDGV 342 

Qy 34 6 VSSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDS-DMLAAGGKIESNLES 4 04 

MM I I I I : : M I I M I I II I II II I : I II I I : I I I M I I :|:|| 
Db 343 MSPEKTMDIFNEMKMSWAPWEEYADFKPFEQAWEVKDTYEGSRDVLAA RANMES 3 98 

Qy 4 05 KVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATN 464 

I II I I I I I I I I I M II I M I : : II I I M I M I I II I II I M I I I I I 
Db 399 KVDKKCFEDSLEQKSHGKDSESRNENASFPSTPELVKDGSRAYITCDSFT-SATESTAAN 457 

Qy 465 IFPLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKV 524 

I II M I I MM I II I II I I M I I I Ml I II I I II M M III II II II I I M I 

Db 458 IFPVLEDHTSENKTDEKKIEERKAQIITEK-TSPKTSNPFLVAIHDSEADYVTTDNLSKV 516 

Qy 525 TEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCP 584 

II III I I II I II I I II I I II I II I I I II II I II I I M I II II I M II M I I I I I I 

Db 517 TEAWATMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESIYPTAQLCP 57 6 

Qy 585 SFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPENPP 643 

I II I M I II II I II II I I M I I II Ml MM I I I M II I I I M : II II I I II 

Db 577 SFEEAEATPSPVLPDIVMEAPLNSLLPSTGASVAQPSASPLEVPSPVSYDGIKLEPENPP 636 

Qy 644 PYEEAMSVSLKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDY 703 

I I I M II I M I I I I II I II I : II I II II I I II I II II II II II I MM MM 
Db 637 PYEEAMSVALKTSDAKEEIKEPESFNAAAQEAEAPYISIACDLIKETKLSTEPSPGFSNY 696 

Qy 704 SEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFES 763 

I I M I I : II I I II I M II I M II I I II I II I I I M I I I M I II M I I II I I I : 
Db 697 SEIAKFEKSVPDHCELVDDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS-ET 755 

Qy 764 MIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTA 823 

: : : : M I M II I I I II II II I : M III M : II M II I Mill Ml 
Db 756 VTQHKHKERLSASPQEVGKPYLESFQPNLHITKDA-ASNEIPTLTKKETISLQMEEFNTA 814 

Qy 824 VYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKS 883 

M I II II III : : M M I I II II I I II II I I I I MM II M I I II I II M I 

Db 815 IYSNDDLLSSKEDKMKESETFSDSSPIEIIDEFPTFVSAKDDS PKEYTDLEVSNKS 870 

Qy 884 EIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSAL 943 

I M I II I I II M II II I I I II I : II M I I : I : II I II I M I I I 
Db 871 EIANVQSGANSLPCSELPCDLSFKNTYPKDEAHV — SDEFSKSRSSVSKVPLLLPNVSAL 928 

Qy 944 GHTQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSVVDLLYWRDIKKT 1003 

M I : MINIM II II M I I I I I II II II M : MM I I I I I I II I I I II I I I 



Db 



92 9 -ESQIEMGNIVKPKVLTKEAEEKLPSDTEKEDRSLTAVLSAELNKTSWDLLYWRDIKKT 987 



Qy 1004 GW-FGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAY 1062 

III I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I II I I I I I II I I I I I I I 
Db 988 GVVYFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRI YKGVIQAIQKSDEGHPFRAY 1047 

Qy 1063 LESEVAISEELVQKYSNSALGHVNCTIKELRRLFL^ 1122 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 
Db 1048 LESEVAISEELVQKYSNSALGHWSTIKELRRLFLVDDLVT)SLKFAVLMWVFTYVGALFN 1107 

Qy 1123 GLTLLILALISLFSVPVIYERHQAQIDHYLGLANKWKDAMAKIQAKIPGLKRKAE 1178 

I I I I I I I I I I I I I I : I I I I I I II I I II I I I I I I I I : I I I I I I I I II I I I I I I I I I I 
Db 1108 GLTLLILALISLFSIPVIYERHQAQIDHYLGLANKSVKDAMAKIQAKIPGLKRKAE 1163 



RESULT 
Q8BGK7 
ID 
AC 
DT 
DT 
DT 
DE 
GN 
OS 



PRELIMINARY; 



PRT; 1046 AA. 



OC 
OC 
OX 
RN 
RP 
RC 
RA 
RT 
RT 
RL 
RN 
RP 
RC 
RA 
RL 
RN 
RP 
RC 
RA 
RL 
RN 
RP 
RC 
RA 
RL 
DR 
DR 
DR 
DR 
DR 
DR 
DR 



Created) 

Last sequence update) 
Last annotation update) 



Craniata; Vertebrata; Euteleostomi ; 
Sciurognathi ; Muridae; Murinae; Mus . 



Q8BGK7 
Q8BGK7; 

01-MAR-2003 (TrEMBLrel. 23, 
01-MAR-2003 (TrEMBLrel. 23, 
01-OCT-2003 (TrEMBLrel. 25, 
RTN4. 
RTN4. 

Mus musculus (Mouse) . 
Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Rodentia; 
NCBI_TaxID=10090; 
[1] 

SEQUENCE FROM N.A. 

STRAIN=129/SvcJ7, and 129SvcJ7; 

Oertle T., van der Putten H., Schwab M.E.; 

"Genomic Structure and Functional Characterization of the Promoter 

Structures of Human and Mouse Nogo/Rtn-4 . " ; 

Submitted (OCT-2002) to the EMBL/ GenBank/DDBJ databases. 

[2] 

SEQUENCE FROM N.A. 
STRAIN=129/SvcJ7, and 129SvcJ7; 
Oertle T., Schwab M.E.; 

Submitted (MAY-2002) to the EMBL/ GenBank/DDBJ databases. 
[3] 

SEQUENCE FROM N.A. 
STRAIN=129/SvcJ7 ; 
Van der Putten Re- 
submitted (MAY-2002) to the EMBL/ GenBank/DDBJ databases. 
[4] 

SEQUENCE FROM N.A. 

STRAIN=129SvcJ7; 

Van der Putten H., Mir A.; 

Submitted (MAY-2002) to the EMBL/ GenBank/DDBJ databases. 
EMBL; AY102280; AAM73502.1; 
EMBL; AY102286; AAM73507.1; -. 
MGD; MGI: 1915835; Rtn4 . 

GO; GO: 0005783; C : endoplasmic reticulum; IDA. 
GO; GO: 0007399; P : neurogenesis ; IDA. 
InterPro; IPR003388; Reticulon. 
Pfam; PF02453; Reticulon; 1. 



DR PROSITE; PS50845; RETICULON; 1. 

SQ SEQUENCE 1046 AA; 114221 MW; 8CE2E2238ED51222 CRC64; 



Query Match 61.8%; Score 3658.5; DB 11; Length 1046; 

Best Local Similarity 69.5%; Pred. No. 2e-172; 

Matches 767; Conservative 108; Mismatches 148; Indels 81; Gaps 17; 

Qy 97 APPVAPERQ PSWDPS PVSSTVPAP SPLSAAAVSPSKLPED 136 

I II : I I : I II I I I I I I I II 
Db 2 AP P LAGGGQKGGAAS EAWVP S L FVGVS GS T CT AAKS LVP I PARS S RL S AA 51 

Qy 137 DEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRGSSGAWXXXXKIMDLK 196 

: I : : I I I : I I : : : I I I I I I 

Db 52 RNETLFALPA ASEPVIPSSAE KIMDLK 78 

Qy 197 EQPGNTISAGQEDFPSVLLETAASXPSLSPLSAASFKEHEYLGNLSTVLPTEGTLQENVS 256 

I I I I I I : I : I I I I I I I I I I I I I I I I I I I I I I I I I I lll:M I I I I I : : I : : 
Db 7 9 EQPGNTVSSGQEDFPSVLFETAASLPSLSPLSTVSFKEHGYLGNLSAVASTEGTIEETLN 138 

Qy 257 EASKEVSEKAKTLLIDRDLTEFSELEYSEMGSSFSVSPKAESAVIVANPREEIIVKNKDE 316 

I I I : I : I : I : : I : III I I I I I I M II : III I I I : : I I : I I : I I : : I I : 

Db 139 EASRELPERATNPFWRESAEFSVLEYSEMGSSFNGSPKGESAMLVENTKEEVIVRSKDK 198 

Qy 317 EEKLVSNNILHXQQELPTALTKLVKEDEWSSEKAKDSFNEKRVAVEAPMREEYADFKPF 37 6 

I : I I : II Ml I I I : I I I I I : I I I I I I I : : : I I I : I I I II I I I I I 
Db 199 ED-LVCSAALHNPQESPATLTKVVKEDGVMSPEKTMDIFNEMKMSWAPVREEYADFKPF 257 

Qy 377 ERVWEVKDSKEDS-DMLAAGGKIESNLESKVDKKCFADSLEQTNHEKDSESSNDDTSFPS 435 

I : I I I I I : I I I : I I I : I : II II I I I I I I I I I I I I I I I I I : : I I I 

Db 258 EQAWEVKDTYEGSRDVLAA RANMESKVDKKCFEDSLEQKGHGKDSESRNENASFPR 313 

Qy 436 TPEGIKDRSGAYITCAPFNPAATESIATNIFPLLEDPTSENXTDEKKIEEKKAQIVTEKN 495 

III : I I I I I I I I I : : II I I I I I I I : I II I I I I I I I I I I I I : I I I I : I I I 
Db 314 TPELVKDGSRAYITCDSFS-SATESTAANIFPVLEDHTSENKTDEKKIEERKAQIITEK- 371 

Qy 4 96 TSTKTSNPFFVAAQDSETDYVTTDNLTKVTEEWANMPEGLTPDLVQEACESELNEVTGT 555 

II I I I I I I II III I I I I I I I I : I I I I III I I I I I I I I I I I I I II I I I I I III 

Db 372 TSPKTSNPFLVAIHDSEADYVTTDNLSKVTEAWATMPEGLTPDLVQEACESELNEATGT 431 

Qy 556 KIAYETKMDLVQTSEVMQESLYPAAQLC PS FEES EAT PSPVLPDIVMEAPLNSAVP SAGA 615 

I I II II I : I I I I I I I : I I I : I I II I I I II I I : I I I I I I I I I I I I I I I I I I I : I I I I 

Db 432 KIAYETKVDLVQTSEAIQESIYPTAQLCPSFEEAEATPSPVLPDIVMEAPLNSLLPSTGA 491 

Qy 616 SVIQPSSSPLEA-SSVNYESIKHEPENPPPYEEAMSVSLKVSGIKEEIKEPENINAALQE 674 

II I I I : I II I I I : I : II II I II I I I I I I I I I : I I I I I I I I I I I : III II 

Db 4 92 SVAQPSASPLEVPSPVSYDGIKLEPENPPPYEEAMSVALKTSDSKEEIKEPESFNAAAQE 551 

Qy 675 TEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQPVPDHSELVEDSSPDSEPVDLFS 734 

I II I I I I II I I I I I I I I I I I : I : I I : I I I : I I I : I I I I I I I : I I I I : I I M I I I I 
Db 552 AEAPYISIACDLIKETKLSTEPSPEFSNYSEIAKFEKSVPDHCELVDDSSPESEPVDLFS 611 

Qy 735 DDSIPDVPQKQDETVMLVKESLTETSFESMIEYENKEKLSALPPEGGKPYLESFKLSLDN 794 

11111:111 I : I I I I : I I I II I I I : : : : : : I I : I I I I I I I II I I I I : : I 
Db 612 DDSIPEVPQTQEEAVMLMKESLTEVS-ETVTQHKHKERLSASPQEVGKPYLESFQPNLHI 670 

Qy 795 TKDTLLPDEVSTLSKKEKIPLQMEELSTAVYSNDDLFISKEAQIRETETFSDSSPIEIID 854 

II I : I : 11:111 I I I I I I : I I : I I I I I I II I ::: I : I I I I I I I II II I I 



Db 



671 TKDA-ASNEIPTLTKKETISLQMEEFNTAIYSNDDLLSSKEDKMKESETFSDSSPIEIID 729 



Qy 855 EFPTLISSKTDSFSKLAREYTDLEVSHKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVE 914 

I I II : I : I I I : II I I I I I I : I I I I I I I I I I I I : I I I I I I I I III 

Db 730 EFPTFVSAKDDS PKEYTDLEVSNKSEIANVQSGANSLPCSELPCDLSFKNTYPKDE 785 

Qy 915 EKISFSDDFSKNGSATSKVLLLPPDVSALGHTQAEIESIVKPKVLEKEAEKKLPSDTEKE 974 

: II : I I I : I : III II I : I II I : I I : MINIM I I I I : I I I I I M I I 
Db 786 AHV— SDEFSKSRSSVSKVPLLLPNVSAL-ESQIEMGNIVKPKVLTKEAEEKLPSDTEKE 842 

Qy 975 DRSPSAIFSADLGKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALL 1034 

III : I : I I : I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 
Db 843 DR5LTAVLSAELNKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALL 902 

Qy 1035 SVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNCTIKELRR 1094 

I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I II II I II 
Db 903 SVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRR 962 

Qy 1095 LFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSVPVIYERHQAQIDHYLGL 1154 

I I I I I I I I I I I I I I I I I I I I I II M I I I I I II I I I II I I II I : I I I I I I I I II I I I II I I 
Db 963 LFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSIPVIYERHQAQIDHYLGL 1022 

Qy 1155 ANKNVKDAMAKI QAKI PGLKRKAE 1178 

I II : I II I I I I I I I I I I I I I I I I I 
Db 1023 ANKSVKDAMAKI QAKI PGLKRKAE 1046 



RESULT 5 
Q7TNB7 

ID Q7TNB7 PRELIMINARY; PRT; 720 AA. 

AC Q7TNB7; 

DT 01-OCT-2003 (TrEMBLrel , 25, Created) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Hypothetical protein. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6; TISSUE=Brain ; 

RX MEDLINE=22388257; PubMed=12477932 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J., Hsieh F., 

RA Diatchenko L., Marusina K. , Farmer A. A. , Rubin G.M., Hong L., 

RA Stapleton M., Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A. , Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A. , McEwan P.J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A. , 

RA Fahey J., Helton E., Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y., Bouffard G.G., 

RA Blakesley R.W., Touchman J.W. , Green E.D., Dickson M.C., 



RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M., Butterfield Y.S., 

RA Krzywinski M.I., Skalska U., Smailus D.E., Schnerch A., Schein J.E., 

RA Jones S.J., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length human 

RT and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:168 99-16903(2002). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6; TISSUE=Brain; 

RA Strausberg R. ; 

RL Submitted (AUG-2003) to the EMBL/ GenBank/DDB J databases. 

DR EMBL; BC056373; AAH56373.1; -. 

KW Hypothetical protein. 

SQ SEQUENCE 720 AA; 77435 MW; 8 0AB7 8 728F16EAB2 CRC64 ; 

Query Match 43.6%; Score 2580.5; DB 11; Length 720; 

Best Local Similarity 70.8%; Pred. No. 2.1e-119; 

Matches 527; Conservative 69; Mismatches 107; Indels 41; Gaps 13; 

Qy 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKP 59 

111:111 I I I I I I I I I I I I I I I I I I I I I I I I I : I I : I I I !:: I I I I I I I I I I I I I I 
Db 1 MEDIDQSSLVSSSADSPPRPPPAFKYQFVTEPEDEEDEEDEEEEEDDEDLEELEVLERKP 60 

Qy 60 AAGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPA 119 

II I I I I I I I I III llhll : I I I I I I I I I I I II I I I I I I I I I I : I I : I I 
Db 61 AAGLSAAPVP — PAA-APLLDFSSDSVPPAPRGPLPAAPPTAPERQPSWERSPAAS A 114 

Qy 120 PSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRR 179 

II I I 1 I I 1 I I I I I I I I I I I I I I I II II I I I I II I I I I I I I I 

Db 115 PSLPPAAAVLPSKLPEDDEPPAR — PPAPAGASPLAE PAAPPSTPAAPKRR 163 

Qy 180 GSSG AWXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLS 225 

I I : II I I I I II I I I I : I : I I I I I II I I I I I I I I I I I 

Db 164 GSGSVDETLFALPAASEPVIPSSAEKIMDLKEQPGNTVSSGQEDFPSVLFETAASLPSLS 223 

Qy 226 PLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSE 285 

III I I I I I I I I I I I I I I I I : : I : : I I I : I : I : I : : I : I I I I I I I I 

Db 224 PLSTVSFKEHGYLGNLSAVASTEGTIEETLNEASRELPERATNPFVNRESAEFSVLEYSE 283 

Qy 286 MGS S FSVS PKAESAVI VANPREEI I VKNKDEEEKLVSNNI LHXQQELPTALTKLVKEDEV 345 

Mill: III I I I : : I I : I I : I I : : I I : I : II : II II I I I I : I I I I I 
Db 284 MGSSFNGSPKGES7\MLVENTKEEVIVRSKDKED-LVCSAALHNPQESPATLTKVVKEDGV 342 

Qy 346 VSSEKAKDSFNEKRVAVEAPMREEYADFKPFERWEVKDSKEDS-DMLAAGGKIESNLES 404 

: I I I I I I I ::: I I I : I I I II II I I I I : I II II : I I I : I I I : I : I I 

Db 343 MSPEKTMDIFNEMKMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAA RANMES 398 

Qy 405 KVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPT^ATESIATN 4 64 

II I I I I I I I I I I I I I I I I I : : I I I I I I : I I I I I I II I : : I I I I I I 

Db 399 KVDKKCFEDSLEQKGHGKDSESRNENASFPRTPELVKDGSRAYITCDSFS-SATESTAAN 457 

Qy 465 IFPLLEDPTSENXTDEKKIEEKKAQLVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKV 524 

111:111 I I I I I I I II I I I : I I I I : I I I II I I I I I I II III I I I I I I I I : I I 
Db 458 IFPVLEDHTSENKTDEKKIEERKAQIITEK-TSPKTSNPFLVAIHDSEADYVTTDNLSKV 516 



Qy 



525 TEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCP 584 
II III I II I II I I I II I I I I I I I I I I I I I I I I I I I : II I I I II : I I I : I I I I I I I 



Db 



517 TEAWATMPEGLTPDLVQEACE5ELNEATGTKIAYETKVDLVQTSEAIQESIYPTAQLCP 576 



Qy 585 SFEESEATPSPVLPDIWEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPENPP 643 

I I I I : I I I I I I I I I I I I I I I I I I I : I I I I I I I I I : I I I I I I : i : I! I I I I I I 
Db 577 SFEEAEATPSPVLPDIVMEAPLNSLLPSTGASVAQPSASPLEVPSPVSYDGIKLEPENPP 636 

Qy 644 PYEEAMSVSLKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDY 703 

I I I I I I II : I I I I I I I I I I I : III II I I I I I I I I I I I || I I I I I I I : I : | I : I 
Db 637 PYEEAMSVALKTSDSKEEIKEPESFNAAAQEAEAPYISIACDLIKETKLSTEPSPEFSNY 696 

Qy 704 SEMAKVEQPVPDHSELVEDSSPDS 727 

I I : I I I : MM -11:111:1 
Db 697 SEIAKFEKSVPDHCELVDDSSPES 720 



RESULT 6 
Q8K290 



ID Q8K290 PRELIMINARY; PRT; 639 AA. 

AC Q8K2 90; 

DT 01-OCT-2002 (TrEMBLrel. 22, Created) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Hypothetical protein. 

GN RTN4 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Strausberg R. ; 

RL Submitted (JUN-2002) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; BC032192; AAH32192.1; -. 

DR MGD; MGI : 19158 35; Rtn4 . 

DR GO; GO: 0005783; C : endoplasmic reticulum; IDA. 

DR GO; GO: 0007399; P : neurogenesis ; IDA. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

KW Hypothetical protein. 

SQ SEQUENCE 639 AA; 70312 MW; 309A19DA37603F11 CRC64; 



Query Match 42.7%; Score 2527; DB 11; Length 639; 

Best Local Similarity 79.5%; Pred. No. 7.7e-117; 

Matches 515; Conservative 57; Mismatches 66; Indels 10; Gaps 6; 

Qy 532 MPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSFEESEA 591 

I I I I II M II I II II I I I I I I I I I II I I I I : I I I I I I I : I I I : I I I I I I I I M I : I I 
Db 1 MPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESIYPTAQLCPSFEEAEA 60 

Qy 592 TPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPENPPPYEEAMS 650 

I M II I I II I II I I I I I : I I I I II I I I : I I I I I I : I : I I II I I M I I I I I I I 
Db 61 TPSPVLPDIVMEAPLNSLLPSTGASVAQPSASPLEVPSPVSYDGIKLEPENPPPYEEAMS 120 

Qy 651 VSLKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVE 710 

MM I I I II II II : Ml II II I I I I II I I I I M II II II : M I I : M M I I I 
Db 121 VALKTSDAKEEIKEPESFNAAAQEAEAPYISIACDLIKETKLSTEPSPEFSNYSEIAKFE 180 



Qy 711 QPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMIEYENK 770 

: I I I I I I I : I I I I : I I 11 I I I M I I I I : I I I I : I t I I : I I I I I I I I : : : : : : I 
Db 181 KSVPDHCELVDDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS-ETVTQHKHK 239 

Qy 771 EKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVYSNDDL 830 

I : I I I I I I I I I I I I I :: I III : I : I I : I I I I I I I I I : I I : I I I I I I 

Db 240 ERLSASPQEVGKPYLESFQPNLHITKDA-ASNEIPTLTKKETISLQMEEFNTAIYSNDDL 298 

Qy 831 FISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEIANAPD 890 

III ::: I : I I I I II I I I I I I I I I I I : I : I II : I I I II I I I : I I I I I I 

Db 299 LSSKEDKMKESETFSDSSPIEIIDEFPTFVSAKDDS PKEYTDLEVSNKSEIANVQS 354 

Qy 891 GAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGHTQAEI 950 

I I I I I I : I I I I II I I III : I I : I I I : I : I I I I I I : I I I I : I I : 
Db 355 GANSLPCSELPCDLSFKNTYPKDEAHV — SDEFSKSRSSVSKVPLLLPNVSAL-ESQIEM 411 

Qy 951 ESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGWFGAS 1010 

: I I I I I I I I I I I : I I I I I I I I I I I I : I : I I : I I I I I I I I I I I I I I I I I I I I I I I I 
Db 412 GNIVKPKVLTKEAEEKLPSDTEKEDRSLTAVLSAELNKTSVVDLLYWRDIKKTGWFGAS 471 

Qy 1011 LFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAIS 1070 

I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 472 LFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAIS 531 

Qy 1071 EELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWFTWGALFNGLTLLIIA 1130 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I 
Db 532 EELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILA 591 

Qy 1131 LISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1178 

I I I II I : I I I I I I I I I I I I I I I I I I I I : I I I I I I I I M I I I I I I I I I I 
Db 592 LISLFSIPVIYERHQAQIDHYLGLANKSVKDAMAKIQ7VKIPGLKRKAE 639 



RESULT 7 
Q80W95 

ID Q80W95 PRELIMINARY; PRT; 578 AA. 

AC Q8 0W95; 

DT 01-JUN-2003 (TrEMBLrel. 24, Created) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Nogo-A (Fragment) . 

GN NOGO-A. 

OS Mus mus cuius (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Tozaki H., Hirata T.; 

RT "The partial sequence of mouse nogo-A cDNA clone#4109 . " ; 

RL Submitted (OCT-2001) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AB073672; BAC75974.1; 

DR GO; GO: 0005783; C : endoplasmic reticulum; IEA. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 



FT NON_TER 1 1 

SQ SEQUENCE 578 AA; 63696 MW; 832670C17 1E4AC61 CRC64; 

Query Match 37.7%; Score 2233; DB 11; Length 578; 

Best Local Similarity 78.4%; Pred. No. 2.2e-102; 

Matches 460; Conservative 52; Mismatches 65; Indels 10; Gaps 6; 

Qy 593 PSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPENPPPYEEAMSV 651 

I I I I I I I II I I I I I I I : I I I I I I I I I : I I I I I I : I : II I I I I I I I I I I I I I I 
Db 1 PSPVLPDIVMEAPLNSLLPSTGASVAQPSASPLEVPSPVSYDGIKLEPENPPPYEEAMSV 60 

Qy 652 SLKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQ 711 

: I I I I I I I I I I I : III II I I I II I I I I I I I I I I I I I II : I I I : I I I : I I I : 
Db 61 ALKTSDAKEEIKEPESFNAAAQEAEAPYISIACDLIKETKLSTEPSPGFSNYSEIAKFEK 120 

Qy 712 PVPDHSELVEDSSPDSEPVDLFSDDS I PDVPQKQDETVMLVKESLTETS FESMI EYENKE 771 

I I I I I I I : I I I I : I I I I I I I I I II I I : I I I I : I I I I : I I I I I I I I : : : : : : I I 
Db 121 SVPDHCELVDDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS-ETVTQHKHKE 179 

Qy 772 KLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVYSNDDLF 831 

: I II I I I I I I I I I I :: I II I : I : I I : II I I I I I I I : I I : I I I M I 

Db 180 RLSASPQEVGKPYLESFQPNLHITKDA-ASNEIPTLTKKETISLQMEEFNTAIYSNDDLL 238 

Qy 832 ISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEIANAPDG 891 

III ::: I : I I I I I I I I I I I I I I I M : I : I II : I I I I I II I : I I II II I 

Db 239 SSKEDKMKESETFSDSSPIEIIDEFPTFVSAKDDS PKEYTDLEVSNKSEIANVQSG 294 

Qy 892 AGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGHTQAEIE 951 

I II I I : I I I I I I I I III : I I : I I I : I : I I I I I : I I I I : I I : 
Db 295 ANSLPCSELPCDLSFKNTYPKDEAHV — SDEFSKSRSSVFKVPLLLPNVSAL-ESQIEMG 351 

Qy 952 SIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGWFGASL 1011 

: I I I I I I I I I I I : I I II I I I I I I I I : I : I I : I I I I I I I I I II I I I I I I I I I I I I I I 
Db 352 NIVKPKVLTKEAEEKLPSDTEKEDRSLTAVLSAELNKTSWDLLYWRDIKKTGWFGASL 411 

Qy 1012 FLLLSLTVFSIVSWAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISE 1071 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I II I II I I I I I I I II I I I I I I I I I I II I I 
Db 412 FLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISE 471 

Qy 1072 ELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWFTYVGALFNGLTLLILAL 1131 

I I I I I I I I I I II I I I I I I M I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I M I I I 
Db 472 ELVQKYSNSALGHWSTIKELRRLFLVT)DLVDSLKFAVLMWVFTYVGALFNGLTLLILAL 531 

Qy 1132 ISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1178 

I I I I I : I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I II I I I I I I I I 
Db 532 I SLFSI PVI YERHQAQIDHYLGLANKSVKDAMAKIQAKI PGLKRKAE 578 



RESULT 8 
Q96B16 

ID Q96B16 PRELIMINARY; PRT; 392 AA. 

AC Q96B16; 

DT 01-DEC-2001 (TrEMBLrel. 19, Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE Hypothetical protein (RTN4) . 

GN RTN4 . 



OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia ; Eutheria ; Primates ; Catarrhini ; Hominidae ; Homo . 

OX NCBI__TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N . A. 

RC TISSUE=Kidney; 

RA Strausberg R. ; 

RL Submitted (OCT-2001) to the EMBL/ GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RA Oertle T . , van der Putten H., Schwab M.E.; 

RT "Genomic Structure and Functional Characterization of the Promoter 

RT Structures of Human and Mouse Nogo/Rtn-4."; 

RL Submitted (OCT-2002) to the EMBL/ GenBank/DDBJ databases. 

RN [3] 

RP SEQUENCE FROM N.A. 

RA Oertle T., Schwab M.E.; 

RL Submitted (MAY-2002) to the EMBL/ GenBank/DDBJ databases. 

RN [4] 

RP SEQUENCE FROM N.A. 

RA Van der Putten H.; 

RL Submitted (MAY-2002) to the EMBL/ GenBank/DDBJ databases. 

RN [5] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=22376540; PubMed=12488097 ; 

RA Oertle T., van der Putten H., Schwab M.E.; 

RT "Genomic Structure and Functional Characterization of the Promoter 

RT Structures of Human and Mouse Nogo/Rtn-4."; 

RL J. Mol. Biol. 325:299-323(2003). 

DR EMBL; BC016165; AAH16165.1; 

DR EMBL; AY102285; AAM64242.1; 

DR EMBL; AY102278; AAM64247.1; -. 

DR GO; GO: 0005783; C : endoplasmic reticulum; IEA. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

KW Hypothetical protein. 

SQ SEQUENCE 392 AA; 42274 MW; D7B2AA5E839E58AD CRC64; 

Query Match 25.8%; Score 1531; DB 4; Length 392; 

Best Local Similarity 32.4%; Pred. No. 6.1e-68; 

Matches 382; Conservative 7; Mismatches 3; Indels 786; Gaps 4; 

Qy 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

I I I I I I I I I I I I I I I I I II I I I I M I I I I I I I I I I I M I I I I I I I I M I II I I I I II I I I 
Db 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

Qy 61 AGL S AAP VPTAP AAGAP LMD FGN D FVP PAP RG P L P AAP P VAP ERQ P S WD P S P VS S T VP AP 120 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 AGL S AAP VP TAP AAGAP LMD FGND FVP PAP RG P L P AAP P VAP ERQ P S WD P S P VS S T VPAP 120 

Qy 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I II I I I I I II I I I I I I I I I I I I I I I I I I I I 
Db 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 180 



Qy 



181 SSGAWXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSPLSAASFKEHEYLGN 240 



I I I : I 

Db 181 SSGSV 185 

Qy 241 LSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEMGSSFSVSPK7VESAV 300 

Db 186 185 

Qy 301 IVANPREEIIVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDEWSSEKAKDSFNEKRV 360 

Db 186 185 

Qy 3 61 AVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKVDKKCFADSLEQTNH 42 0 

Db 186 185 

Qy 421 EKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIFPLLEDPTSENXTDE 480 

I : I I 

Db 186 DETLF 190 

Qy 481 KKIEEKKAQIWEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTEEWANMPEGLTPDL 540 

Db 191 190 

Qy 541 VQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSFEESEATPSPVLPDI 600 

Db 191 190 

Qy 601 VMEAP LN S AVP SAGAS VI Q P S S S P LEAS S VN YE S I KHE P EN P P P YE EAMS VS LKVS G I KE 660 

I : I : I 

Db 191 ALPAA 195 

Qy 661 EIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQPVPDHSELV 720 

Db 196 195 

Qy 721 EDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMIEYENKEKLS7VLPPEG 780 

I I I I 

Db 196 SEPV 199 

Qy 781 GKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVYSNDDLFISKEAQIRE 840 

Db 200 199 

Qy 841 TETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEIANAPDGAGSLPCTEL 900 

Db 200 199 

Qy 901 PHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGHTQAEIESIVKPKVLE 960 

Db 200 199 

Qy 961 KEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGWFGASLFLLLSLTVF 1020 

: :: I II I I I I I I I I I I I I I I I I II I I I I I I I II 
Db 2 00 1 RS SAWDLLYWRDI KKTGWFGAS LFLLLSLTVF 234 

Qy 1021 SIVSVTAYIALALLSVTISFRIYKGVI QAIQKSDEGHPFRAYLESEVAI SEELVQKYSNS 1080 

M I I I II I I I II I I I I I I I II I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | || | | | I I I I 



Db 235 SIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNS 294 

Qy 1081 ALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSVPVI 1140 

I I I I I II I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I II I I 
Db 295 ALGHWCTIKELRRLFLVDDLVDSLKFAVIJVI^ 354 

Qy 1141 YERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1178 

I I I I I I I I I I I II I I I I I I I I I I I M I I I I I I I I I I I I 
Db 355 YERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 392 

RESULT 9 
Q8BHF5 



ID Q8BHF5 PRELIMINARY; PRT; 375 AA. 

AC Q8BHF5; 

DT 01-MAR-2003 (TrEMBLrel. 23, Created) 

DT 01-MAR-2003 (TrEMBLrel . 23, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE RTN4 . 

GN RTN4 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=1009 0; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=129/SvcJ7, and 129SvcJ7; 

RA Oertle T . , van der Putten H., Schwab M.E.; 

RT "Genomic Structure and Functional Characterization of the Promoter 

RT Structures of Human and Mouse Nogo/Rtn-4 . " ; 

RL Submitted (OCT-2002) to the EMBL/ GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN— 12 9/ Svc J7 , and 129SvcJ7; 

RA Oertle T., Schwab M.E.; 

RL Submitted (MAY-2002) to the EMBL/ GenBank/ DDB J databases. 

RN [3] 

RP SEQUENCE FROM N.A. 

RC STRAIN=129/SvcJ7; 

RA Van der Putten H.; 

RL Submitted (MAY-2002) to the EMBL/ GenBank/DDBJ databases. 

RN [4] 

RP SEQUENCE FROM N.A. 

RC STRAIN=129SvcJ7; 

RA Van der Putten H., Mir A. ; 

RL Submitted (MAY-2002) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AY102282; AAM73504.1; 

DR EMBL; AY102286; AAM73509.1; 

DR MGD; MGI: 1915835; Rtn4 . 

DR GO; GO: 0005783; C : endoplasmic reticulum; IDA. 

DR GO; GO: 0007399; P : neurogenesis ; IDA. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

SQ SEQUENCE 375 AA; 40300 MW; 2 3D9EB1 9BE67 1AE6 CRC64; 



Query Match 



20.3%; Score 1201.5; DB 11; Length 375; 



Best Local Similarity 28.6%; Pred. No. l.le-51; 

Matches 337; Conservative 17; Mismatches 20; Indels 805; Gaps 11 

Qy 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKP 59 

I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I : I I I I :: I I I I I I I I I I I I I I 
Db 1 MEDIDQSSLVSSSADSPPRPPPAFKYQFVTEPEDEEDEEDEEEEEDDEDLEELEVLERKP 60 

Qy 60 AAGL S AAP VP T AP AAGAP LMD FGND FVP PAP RG P L PAAP P VAP E RQ P S WD P S P VS S T VP A 119 

I I I I I I I I I I III llhll : I I I I I I I I I I I I I I I I I I I I I I I : II : I I 

Db 61 AAGLSAAPVP — PAA-APLLDFSSDSVPPAPRGPLPAAPPTAPERQPSWERSPAAS A 114 

Qy 120 PSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRR 179 

II I I II I M I I I I I I I I I I II II I I I I I I II I I I I I I II I I 

Db 115 PSLPPAAAVLPSKLPEDDEPPAR — PPAPAGASPLAE PAAP P S T PAAP KRR 163 

Qy 180 GSSGAWXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSPLSAASFKEHEYLG 239 
I I 

Db 164 GS 165 

Qy 240 NLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEMGSSFSVSPKAESA 299 

Db 166 165 

Qy 300 VI VANPREEI IVKNKDEEEKLVSNNI LHXQQELPTALTKLVKEDEWS S EKAKDS FNEKR 359 

Db 166 165 

Qy 360 VAVEAPMREE YAD FKP FERVWEVKDS KEDS DMLAAGGKI ESNLES KVDKKCFAD S LEQTN 419 

Db 166 165 

Qy 420 HEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIFPLLEDPTSENXTD 479 

I I : I I 

Db 166 GSVDETLF 173 

Qy 4 80 EKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYWTDNLTKWEEWANMPEGLTPD 539 

Db 174 173 

Qy 54 0 LVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSFEESEATPSPVLPD 599 

Db 174 173 

Qy 600 IVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYEEAMSVSLKVSGIK 659 

I : I : I 

Db 174 ALPAA 178 

Qy 660 EEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQPVPDHSEL 719 

Db 179 178 

Qy 720 VEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMIEYENKEKLSALPPE 779 

I I I I 

Db 17 9 SEPV 182 

Qy 780 GGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVYSNDDLFISKEAQIR 839 



Db 



183 



182 



Qy 840 ETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEIANAPDGAGSLPCTE 899 

Db 183 182 

Qy 900 LPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGHTQAEIESIVKPKVL 959 

Db 183 182 

Qy 960 EKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGWFGASLFLLLSLTV 1019 

: I I I I I I I I I I I I I I I 11 I I I I I I I I I I I I I I 

Db 183 IPS SAWDLLYWRDIKKTGWFGASLFLLLSLTV 216 

Qy 1020 FSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSN 1079 

I I I I I I I I I I I I I I I I 1 I I I 1 I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I II 
Db 217 FSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSN 276 

Qy 1080 SALGHWCTIKELRRLFLVDDLVDSLKFAVLMWFTYVGALFNGLTLLILALISLFSVPV 1139 

I I I I I I I I I I II I I 1 I I I I I II I I I i I I II I I I I I I I I I I I I I I I I I I I I II I I I I : I I 
Db 277 SALGHWSTIKELRRLFLVDDLVDSLKFAVmWFTWGALFNGLTLL 336 

Qy 1140 I YERHQAQI DHYLGLANKNVKDAMAKIQAKI PGLKRKAE 1178 

I I I I I I I I I I I I I I II II : I I I I II I I I I I I I I I I I I I I 
Db 337 I YERHQAQI DHYLGLANKSVKDAMAKIQAKI PGLKRKAE 375 



RESULT 10 
Q8BH78 

ID Q8BH7 8 PRELIMINARY; PRT; 356 AA. 

AC Q8BH7 8; 

DT 01-MAR-2003 (TrEMBLrel . 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE RTN4 . 

GN RTN4 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=129/SvcJ7, and 129SvcJ7; 

RA Oertle T., van der Putten H . , Schwab M.E.; 

RT "Genomic Structure and Functional Characterization of the Promoter 

RT Structures of Human and Mouse Nogo/Rtn-4 . " ; 

RL Submitted (OCT-2002) to the EMBL/ GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=129/SvcJ7, and 129SvcJ7; 

RA Oertle T., Schwab M.E.; 

RL Submitted (MAY-2002) to the EMBL/ GenBank/DDBJ databases. 

RN [3] 

RP SEQUENCE FROM N.A. 

RC STRAIN=12 9/SvcJ7; 

RA Van der Putten H. ; 

RL Submitted (MAY-2002) to the EMBL/ GenBank/DDBJ databases. 



RN [4] 

RP SEQUENCE FROM N.A. 

RC STRAIN=129SvcJ7; 

RA Van der Putten H., Mir A.; 

RL Submitted (MAY-2 002) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; AY102281; AAM73503.1; -. 

DR EMBL; AY102286; AAM73508.1; -. 

DR MGD; MGI: 1915835; Rtn4 . 

DR GO; GO: 0005783; C : endoplasmic reticulum; IDA. 

DR GO; GO: 0007399; P : neurogenesis ; IDA. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

SQ SEQUENCE 356 AA; 38403 MW; 4366C03BA9630B56 CRC64; 

Query Match 19.6%; Score 1163; DB 11; Length 356; 

Best Local Similarity 27.7%; Pred. No. 8e-50; 

Matches 326; Conservative 12; Mismatches 17; Indels 824; Gaps 8; 

Qy 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKP 59 

111:111 I I I I I I I I I 11 I I I I I I I I I I II I I : I I : I I I I :: I I I I I I I I I I I I I I 
Db 1 MEDIDQSSLVSSSADSPPRPPPAFKYQFVTEPEDEEDEEDEEEEEDDEDLEELEVLERKP 60 

Qy 60 AAGL S AAP VP T APAAGAP LMD FGN D FVP PAP RGP L P AAP P VAP ERQP SWD P S P VS S T VPA 119 

I I I I I I I I I I III llhll : I II I II I I I M II I I I I I I I I I I : II : I I 

Db 61 AAGL S AAP VP — PAA-APLLDFSSDSVPPAPRGPLPAAPPTAPERQPSWERSPAAS A 114 

Qy 120 PSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPWTPPAPAPAAPPSTPAAPKRR 179 

II I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I 

Db 115 PSLPPAAAVLPSKLPEDDEPPAR — PPAPAGASPLAE PAAPPSTPAAPKRR 163 

Qy 18 0 GSSGAWXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSPLSAASFKEHEYLG 239 



Db 164 GS 165 

Qy 240 NLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEMGSSFSVSPKAESA 299 

Db 166 165 

Qy 300 VIVANPREEIIVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDEWSSEKAKDSFNEKR 359 

Db 166 165 

Qy 360 VAVEAPMREEYADFKPFERWEWDSKEDSDMIAAGGKIESNLESKVDKKCFADSLEQTN 419 

Db 166 165 

Qy 420 HEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIFPLLEDPTSENXTD 479 

Db 166 165 

Qy 480 EKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTEEVV'ANMPEGLTPD 539 

Db 166 165 

Qy 540 LVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSFEESEATPSPVLPD 599 



Db 



166 



165 



Qy 600 IVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYEEAMSVSLKVSGIK 659 

Db 166 165 

Qy 660 EEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQPVPDHSEL 719 

Db 166 165 

Qy 720 VEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMIEYENKEKLSALPPE 779 

Db 166 165 

Qy 780 GGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVYSNDDLFISKEAQIR 839 

Db 166 165 

Qy 840 ETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEIANAPDGAGSLPCTE 899 

Db 166 GSV 168 

Qy 900 LPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGHTQAEIESIVKPKVL 959 

Db 169 168 

Qy 960 EKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGWFGASLFLLLSLTV 1019 

I M I I I I I I I I I II I I I I I I I I I I I II I I 
Db 169 WDLLYWRDIKKTGWFGASLFLLLSLTV 197 

Qy 1020 FS I VS VTAYI ALALLS VT I S FRI YKGVI QAI QKS DEGHP FRAYLE S EVAI S EELVQK YSN 1079 

I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II I 
Db 198 FSIVSVTAYIAL7VLLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSN 257 

Qy 1080 SALGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLIIAilSLFSVPV 1139 

I I I I I II I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I II II I I I I I I I II I I I I II I I : I I 
Db 258 SALGHVNSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVG7VLFNGLTLLILALISLFSIPV 317 

Qy 1140 I YERHQAQI DH YLGLANKNVKDAMAKI QAKI P GLKRKAE 1178 

I I I I I I I I I I I I I I I I I I : I I I I I I I II I I I I I I I I I I I 
Db 318 I YERHQAQ I DH YLGLANKS VKDAMAKI QAKI PGLKRKAE 356 



RESULT 11 
Q8K3G7 

ID Q8K3G7 PRELIMINARY; PRT; 357 AA. 

AC Q8K3G7; 

DT 01-OCT-2002 (TrEMBLrel. 22, Created) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Nogo-B. 

GN RTN4 . 

OS Mus mus cuius (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 



RP SEQUENCE FROM N.A. 

RC STRAIN=BALB/c; 

RA Jin W., Li R. , Long M. , Shen J., Ju G. ; 

RT "Cloning and expression of the mouse Nogo-B protein."; 

RL Submitted (MAY-2002) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AY114153; AAM77069.1; 

DR MGD; MGI : 1915835; Rtn4 . 

DR GO; GO: 0005783; C : endoplasmic reticulum; IDA. 

DR GO; GO:0007399; P : neurogenesis ; IDA. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

SQ SEQUENCE 357 AA; 38566 MW; 73BB3D17DFDBDF15 CRC64; 

Query Match 19.5%; Score 1152.5; DB 11; Length 357; 

Best Local Similarity 27.6%; Pred. No. 2.6e-49; 

Matches 326; Conservative 12; Mismatches 17; Indels 825; Gaps 9 

Qy 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKP 59 

111 = 111 I I I II I I I I I I I I I I I I I I I I I I I I : I I : I I M : : I I II I I II I I I I I I 
Db 1 MEDIDQSSLVSSSADSPPRPPPAFKYQFVTEPEDEEDEEDEEEEEDDEDLEELEVLERKP 60 

Qy 60 AAGL S AAP VP T AP AAGAP LMD FGN D EVP PAP RG P L P AAP P VAP E RQ P S WD P S P VS S T VP A 119 

I I I I II I I I I III I I I : I I : I I I I I I I I I I I I I I! I I I I I I I I : II : I I 

Db 61 AAGL S AAP VP — PAA-APLLDFSSDSVPPAPRGPLPAAPPTAPERQPSWERSPAAS A 114 

Qy 120 PSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRR 179 

II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 115 PSLPPAAAVLPSKLPEDDEPPAR — PPAPAGASPLAE PAAPPSTPAAPKRR 163 

Qy 180 GSSGAWXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSPLSAASFKEHEYLG 239 
I I 

Db 164 GS 165 

Qy 240 NLSTVLPTEGTLQENVS EASKEVSEKAKTLLIDRDLTEFSELEYSEMGSSFSVSPKAESA 299 

Db 166 165 

Qy 300 VIVANPREEIIVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDEWSSEKAKDSFNEKR 359 

Db 166 165 

Qy 360 VAVEAPMREE YAD FKP FERVWEVKDS KEDS DMLAAGGKI ESNLES KVDKKC FADS LEQTN 419 

Db 166 165 

Qy 420 HEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIFPLLEDPTSENXTD 479 

Db 166 165 

Qy 480 EKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTEEWANMPEGLTPD 539 

Db 166 165 

Qy 540 LVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPS FEES EAT PSPVLPD 599 

Db 166 165 



Qy 600 IVMEAPLNSAVPSAGASVIQPSSS PLEAS S WYES I KHEPENPPPYEEAMSVS.LKVSGIK 659 

Db 166 165 

Qy 660 EEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQPVPDHSEL 719 

Db 166 165 

Qy 72 0 VEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMIEYENKEKLSALPPE 779 

Db 166 165 

Qy 780 GGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVYSNDDLFISKEAQIR 839 

Db 166 165 

Qy 840 ETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEIANAPDGAGSLPCTE 899 

I t : 

Db 166 GSV 168 

Qy 900 LPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGHTQAEIESIVKPKVL 959 

Db 169 168 

Qy 960 EKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGW-FGASLFLLLSLT 101E 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ni Ml III 

Db 169 WDLLYWRDIKKTGWYFGASLFLLLSLT 197 

Qy 1019 VFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYS 1076 

I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I I I I 11 I I I I I I I I I I I I I I I I I I I I I I 
Db 198 VFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYS 257 

Qy 107 9 NSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWFTWGALFNGLTLLILALISLFSVP 1136 

I I I I I I II I I I I I I II I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I : I 
Db 258 NSALGHVNSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSIP 317 

Qy 1139 VI YERHQAQI DH YLGLANKNVKDAMAKI QAKI PGLKRKAE 1178 

I I I I I I I I I I I I I I I I I I I : I I I I I I II I I I I I I I II I I I 
Db 318 VI YERHQAQ I DHYLGLANKSVKDAMAKI QAKI PGLKRKAE 357 



RESULT 12 
Q7YRW9 

ID Q7YRW9 PRELIMINARY; PRT; 184 AA. 

AC Q7YRW9; 

DT 01-OCT-2003 (TrEMBLrel. 25, Created) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE RTN4w (Fragment) . 

GN RTN4 . 

OS Bos taurus (Bovine) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Cetartiodactyla ; Ruminantia; Pecora; Bovoidea; 

OC Bovidae; Bovinae; Bos. 

OX NCBI_TaxID=9913; 

RN [1] 



RP SEQUENCE FROM N.A. 

RX MEDLINE=22715887; PubMed=12832288 ; 

RA Oertle T . , Klinger M. , Stuermer C.A. , Schwab M.E.; 

RT "A reticular rhapsody: phylogenic evolution and nomenclature of the 

RT RTN/Nogo gene family."; 

RL FASEB J. 17:1238-1247(2003). 

DR EMBL; AY164744; AAP47319.1; 

FT NONJTER 1 1 

SQ SEQUENCE 184 AA; 20671 MW; DE990E03BBAF84A1 CRC64; 



Query Match 15.2%; Score 901; DB 6; Length 184; 

Best Local Similarity 98.9%; Pred. No. 2.8e-37; 

Matches 182; Conservative 0; Mismatches 2; Indels 0; Gaps 0; 

Qy 995 L YWRDI KKTGWFGAS LFLLLS LTVFS I VS VTAYI ALALLSVT I S FRI YKGVI QAI QKS D 1054 

I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I M I I I I 
Db 1 L YWRDI KKTGVVFGXXLFLLLS LTVFS I VSWAYI ALALLSVT IS FRI YKGVI QAI QKSD 60 

Qy 1055 EGHPFRAYLESEVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVT)SLKFAVLMWVF 1114 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I II I I I I I I I I I I II I I I I I I I I I I I || 
Db 61 EGHPFRAYLESEVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWVF 120 

Qy 1115 TWGALFNGLTLLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLK 1174 

I I I I I I I I II I I I II I I I I I I I I I I I I I I I I M I I I I II I I I I I I I I I I II II I I I I I II 
Db 121 TWGALFNGLTLLIIALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLK 180 

Qy 1175 RKAE 1178 

I I I I 

Db 181 RKAE 184 



RESULT 13 
Q7T224 

ID Q7T224 PRELIMINARY; PRT; 199 AA. 

AC Q7T224; 

DT 01-OCT-2003 (TrEMBLrel. 25, Created) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE RTN4-C . 

GN RTN4 . 

OS Gallus gallus (Chicken) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Archosauria; Aves; Neognathae; Galliformes; Phasianidae; Phasianinae; 

OC Gallus. 

OX NCBIJTaxID=9031; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=22715887; PubMed=128322 88 ; 

RA Oertle T., Klinger M. , Stuermer C.A., Schwab M.E.; 

RT "A reticular rhapsody: phylogenic evolution and nomenclature of the 

RT RTN/Nogo gene family."; 

RL FASEB J. 17:1238-1247(2003). 

DR EMBL; AY164737; AAP47312.1; -. 

SQ SEQUENCE 199 AA; 22293 MW; 07CF4E4EF2723251 CRC64; 



Query Match 15.0%; Score 887; DB 13; Length 199; 

Best Local Similarity 88.2%; Pred. No. 1.6e-36; 



Matches 180; Conservative 



7; Mismatches 11; Indels 6; Gaps 



1; 



Qy 975 DRSPSAIFSADLGKTS VVDLLYWRDIKKTGVVFGASLFLLLSLTVFSIVSVTAYIALALL 1034 

I i I I I I I I I I I I I I I I I I II M I I I I I I I I I I I I I I I I I I I I | I | I I I 

Db 2 DSQPSG WKDKVVDLLYWRDI KKTGVVFGAS LFLLLSLTVFS I VS VTAYI ALALL 55 

Qy 1035 SVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNCTIKELRR 1094 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I : I I : I : I I I I : I I I : | I : I I I I 
Db 56 SVTISFRIYKGVIQAIQKSDEGHPFRAYLESDVAVSEDLIQKYSSWLGHINGTVKELRR 115 

Qy 1095 LFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSVPVIYERHQAQIDHYLGL 1154 

! I M I I I I I I I I I I I I I M I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | 
Db 116 LFLVI)DLvI)SLKFAVLMWVFTYVGALFNGLTLLILALISLFSVPVIYERHQAQIDHYLGL 175 

Qy 1155 ANKNVKDAMAKIQAKI PGLKRKAE 1178 

I I I II I I I II I II I I II I I I I I 
Db 176 VNKNVKDAMAKIQAKIPGLKRKTE 199 



RESULT 14 
Q9GM33 

ID Q9GM33 PRELIMINARY; PRT; 179 AA. 

AC Q9GM33; 

DT 01-MAR-2001 (TrEMBLrel. 16, Created) 

DT 01-MAR-2001 (TrEMBLrel. 16, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Hypothetical protein. 

OS Macaca fascicularis (Crab eating macaque) (Cynomolgus monkey) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae ; 

OC Cercopithecinae; Macaca. 

OX NCBI_TaxID=954 1 ; 

RN [1] 

RP SEQUENCE FROM N . A. 

RC TISSUE=Brain; 

RA Osada N., Hida M. , Kusuda J., Tanuma R. , Iseki K., Hirai M. , Terao K. , 

RA Suzuki Y., Sugano S., Hashimoto K. ; 

RT "Isolation of full-length cDNA clones from macaque brain cDNA 

RT libraries."; 

RL Submitted (OCT-2 000) to the EMBL/ GenBank/DDB J databases. 

DR EMBL; AB049853; BAB16739.1; -. 

DR GO; GO: 0005783; C : endoplasmic reticulum; IEA. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

KW Hypothetical protein. 

SQ SEQUENCE 179 AA; 19949 MW; 5F8CD4 383FEE9E02 CRC64; 

Query Match 14.6%; Score 867; DB 6; Length 179; 

Best Local Similarity 98.3%; Pred. No. 1.3e-35; 

Matches 176; Conservative 3; Mismatches 0; Indels 0; Gaps 0; 

Qy 1000 IKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPF 1059 

: I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I : I I I I I I I I I I I I I II 
Db 1 MKKTGWFGASLFLLLSLTVFSIVS VTAYI ALALL SVT IS FRIYRGVIQAIQKS DEGHPF 60 

Qy 1060 RAYLESEVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGA 1119 



Db 61 RAYLESEVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLRFAVIjyfWVFTYVGA 120 

Qy 1120 LFNGLT LLI LAL I S LFS VP VI YERHQAQI DHYLGLANKNVKDAMAKI QAKI P GLKRKAE 1178 

I M I I I I I I I I I I I I I I I I I I I I I I II M I I I | | | | | M I I I I I I I I I I I I I I I I I I I I 
Db 121 LFNGLTLL I IALIS LFS VP VI YERHQAQI DHYLGLANKNVKDAMAKI QAKI PGLKRKAE 179 



RESULT 15 
Q90638 

ID Q90638 PRELIMINARY; PRT; 760 AA. 

AC Q90638; 

DT 01-NOV-1996 (TrEMBLrel. 01, Created) 

DT 01-NOV-1996 (TrEMBLrel. 01, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE ChS-Rex-b (RTN1-A) . 

GN RTN1. 

OS Gallus gallus (Chicken) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Archosauria; Aves; Neognathae; Galliformes; Phasianidae; Phasianinae; 

OC Gallus. 

OX NCBI_TaxID=9031; 

RN [1] 

RP SEQUENCE FROM N . A. 

RC TISSUE=Brain; 

RX MEDLINE=96386034; PubMed=87 93864 ; 

RA Baka I.D., Ninkina N.N., Pinon L.G., Adu J., Davies A.M., 

RA Georgiev G.P., Buchman V.L.; 

RT "Intracellular compartmentalization of two differentially spliced s- 

RT rex/NSP mRNAs in neurons."; 

RL Mol. Cell. Neurosci. 7:289-303(1996). 

RN [2] 

RP SEQUENCE FROM N . A. 

RC TISSUE-Brain; 

RX MEDLINE=97183663; 

RA Ninkina N.N., Baka I.D., Buchman V.L.; 

RT "Rat and chicken s-rex/NSP mRNA: nucleotide sequence of main 

RT transcripts and expression of splice variants in rat tissues."; 

RL Gene 184:205-210(1997). 

RN [3] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=22715887; PubMed=12 8322 8 8 ; 

RA Oertle T., Klinger M. , Stuermer C.A., Schwab M.E.; 

RT "A reticular rhapsody: phylogenic evolution and nomenclature of the 

RT RTN/Nogo gene family."; 

RL FASEB J. 17:1238-1247(2003). 

DR EMBL; U17606; AAC60075.1; -. 

DR EMBL; AY164724; AAP47301.1; 

DR GO; GO: 0005783; C : endoplasmic reticulum; IEA. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

SQ SEQUENCE 760 AA; 82502 MW; 465C1B429F799D5C CRC64; 

Query Match 13.3%; Score 788.5; DB 13; Length 760; 

Best Local Similarity 27.7%; Pred. No. 7e-31; 

Matches 296; Conservative 115; Mismatches 315; Indels 341; Gaps 38; 



Qy 143 PPPPPPASVSPQAEPWTPPAPAPAAPPSTPAAPKRRGSSGAVVXXXXKIMDLKEQPGNT 202 

III : : I I I II I III : | 

Db 4 PPDPQDLLLAGTAER-W AAAGAD E YAAGAAL RD GD GA QQREQL 45 

Qy 203 I SAGQEDFPSVLLETAASXPSLS PLSAAS FKEHEYLGNLSTVLPTEGTLQENVSEASKEV 262 

: I I : I I II I I : : I I : I : : : I 

Db 4 6 AFGSAREHPPVAMATA S PGVTAS SRLFDY GS S SAN GADS S FYT 88 

Qy 263 SEKAKTLLID RDLTEFSELEYSEMGSSFSVSPKAESAVIVANPREEIIVKNKDE 316 

I I : I I I I I : : I : I I I | 
Db 89 S LISDVHYTTPRDNTYFTGV-YQQENSPIPCSGSTEGFNALGHPVQDV TGFE 139 

Qy 317 EEKLVSNNILHXQQELPTALTKLVKEDEWSSEKAKDSFNEKRVAVEAPMREEYADFKPF 376 

II I I : : I I | : : | | : I : 

Db 14 0 SRGLFS LDSGIEMT PAESA EVDKSLTDPMKVEGYKYMDI 178 

Qy 377 ERVWEVKDSKEDSDMLAAGGKIESNLESKVDKKCFADSLEQTNHEKDSESSNDD — TSFP 434 

11:1 I I : I I I : I : 

Db 17 9 SRPEEIK YQEKHDPDSEDESPDLIDEYR 2 06 

Qy 435 STPEGIKDRSGAYITCAPFNPAATESIATNIFPLLEDPTSENXTDEKKIEEKKAQIVTEK 494 

III II I I : I : I I : I I I : : : I : M 

Db 207 GTPIG SGH — AAEPQRTTASEAIKA PKEQDPLE DKS FRDQHNASWTAP 253 

Qy 495 NTSTKTSNPFFVAAQDSETDYVTTDNLTKVTEEWANM PE GLTP 538 

III II : | : | | | : | | I | 

Db 254 VKITLTETPGAREATSKEASVTQPKSGLKPSHEWPTVMVSEPEDDSPGSVTPPSSGTEP 313 

Qy 539 DLVQEACESELNEWGTKIAYETKMDLVQTSEVMQESLYPAAQLCPS FEES EAT PSPVLP 598 

: : I : I II : I I I I I II I I : I II 

Db 314 SGSESQGKGSLSEDELISAIKEAKGFSFETSEVQQS PAV— — SAEKQEQKMKPGRP 366 

Qy 599 DI VMEAPLNSAVPSA — GASVIQ- PS S S PLEAS SV NYESIKHEPENPPPYEEAMSVS 652 

: : II :: II III: I III I I I : I III II 

Db 367 — AVPSPLDNEAS SAESGDSEIELVSEDPLAAEEVLHSNYMTFSH-IGGPPP — SPASPS 421 

Qy 653 LKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQP 712 

: : I : : I I : | : : | : | | : :.| : | I II 

Db 422 IQYSILREERE AELDSELIIESCD ASSASEESP KREQD 459 

Qy 713 VPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMIEYENKEK 772 

I :| I ::| I I I : 

Db 460 SPLMKPMVMD IIKE ENSSR 478 

Qy 773 LSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVYSNDDLFI 832 

I I I 

Db 479 ASASDYE 4 85 

Qy 833 SKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEIANAPDGA 892 

: I I I I : : I I III 
Db 486 ASKTTE-SRMNRE N LADS A 503 

Qy 893 GSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGHTQAEIES 952 

I : : I I I : III::: 
Db 504 SYLKSS FVAPKVSSE PPTSAVSTEELKERI 533 



Qy 


953 


IVKPKVLEKEAEKKLPSDTEKEDRSPSAI-FSADLGKTSWDLLYWRDIKKTGWFGASL 
1 : 1 : 1 : 1 : INI: II : : 1 I 1 1 1 1 1 1 : 1 1 : 1 1 | : I 
ILKKPIEETVWQSKVSSKDSGKRSPLALPLLPFLNKQKAINLLYWRDIKQTGIVFGSLL 


1011 


Db 


534 


593 


Qy 


1012 


FLLLSLTVFSIVSVTAYIALALLSVTISFRI YKGVIQAIQKSDEGHPFRAYLESEVAISE 
M III M M M 1 1 : 1 1 1 II II 1 1 II 1 1 1 : 1 1 : M : | M 1 II M 1 1 : 1 : : 1 : 
LLLFSLTQFSVVSWAYLALAGLSATISFRI YKSVLQAVQKTDEGHPFKAYLDMEMNLSQ 


1071 


Db 


594 


653 


Qy 


1072 


ELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILAL 
: MIM: Ml 1 : 1 1 1 1 1 I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 1 1 M 1 1 1 1 1 1 M M : 
DQIQKYTDCLQLYVNSTVKELRRLFLVQDLVDSLKFAVLMWLLTYVGALFNGLTLLIMAV 


1131 


Db 


654 


713 


Qy 


1132 


ISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 117 8 
MM:MI:|::MMM MM :: MIIMIMI Mill 
VSMFTLPWYDKYQAQIDQYLGLVRTHINTWAKIQAKIPGAKRKAE 760 




Db 


714 





Search completed: September 3, 2004, 16:08:53 
Job time : 82.0403 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on : 



Title: 

Perfect score: 
Sequence : 



September 3, 2004, 15:58:54 ; Search time 15.6624 Seconds 

(without alignments) 
3916.307 Million cell updates/sec 

US-09-830-972-29 
5923 

1 MEDLDQSPLVSSSDSPPRPQ VKDAMAKIQAKIPGLKRKAE 1178 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 141681 seqs, 52070155 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



141681 



Database 



SwissProt 42:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 



% 

Query 



No. 


Score 


Match 


Length 


DB 


ID 


Description 


1 


5815 


98. 


2 


1192 


1 


RTN4_HUMAN 


Q9nqc3 


homo sapien 


2 


4296.5 


72. 


5 


1163 


1 


RTN4_RAT 


Q9jkll 


rattus norv 


3 


915 


15. 


4 


199 


1 


RTN4_MOUSE 


Q99p72 


mus rausculu 


4 


791 


13. 


4 


111 


1 


RTN1_RAT 


Q64548 


rattus norv 


5 


787.5 


13. 


3 


116 


1 


RTN INHUMAN 


Q16799 


homo sapien 


6 


626 


10. 


6 


237 


1 


RTN3_MOUSE 


Q9es97 


mus musculu 


7 


625.5 


10. 


6 


236 


1 


RTN3_HUMAN 


095197 


homo sapien 


8 


514 


8. 


7 


545 


1 


RTN2_HUMAN 


075298 


homo sapien 


9 


500 


8. 


4 


471 


1 


RTN2_MOUSE 


070622 


mus rausculu 


10 


324.5 


5. 


5 


865 


1 


CPN_DR0ME 


Q02910 


drosophila 


11 


320 


5. 


4 


5147 


1 


PCLO_HUMAN 


Q9y6v0 


homo sapien 


12 


315 


5. 


3 


5038 


1 


PCL0_M0USE 


Q9qyx7 


mus musculu 


13 


299 


5. 


0 


3924 


1 


ANK2_HUMAN 


Q01484 


homo sapien 


14 


296.5 


5. 


0 


2468 


1 


MAPB_HUMAN 


P46821 


homo sapien 


15 


289 


4. 


9 


2459 


1 


MAPB_RAT 


P15205 


rattus norv 


16 


289 


4. 


9 


2464 


1 


MAPB_MOUSE 


P14873 


mus musculu 


17 


287 


4. 


8 


5120 


1 


PCLO_CHICK 


Q9pu36 


gallus gall 



18 


286. 5 


4 , 


. 8 


3644 


1 


MINT 


_MOUSE 


Q62504 


mus inusculu 


19 


276 


4 , 


.7 


5085 


1 


PCLO_ 


_RAT 


Q9jks6 


rattus norv 


20 


273.5 


4, 


.6 


3664 


1 


MINT_ 


_HUMAN 


Q96t58 


homo sapien 


21 


264.5 


4, 


.5 


8545 


1 


ANC1_ 


CAEEL 


Q9n4m4 


caenorhabdi 


22 


263.5 


4, 


.4 


3396 


1 


PGCV_ 


_HUMAN 


P13611 


homo sapien 


23 


262.5 


4 , 


.4 


6632 


1 


UN8 9_ 


_CAEEL 


001761 


caenorhabdi 


24 


260.5 


4 , 


.4 


3381 


1 


PGCV_ 


BOVIN 


P81282 


bos taurus 


25 


259.5 


4. 


.4 


1828 


1 


MAP2_ 


_MOUSE 


P20357 


mus musculu 


26 


258.5 


4 . 


.4 


2805 


1 


MAPA 


_HUMAN 


P78559 


homo sapien 


27 


258 


4 , 


.4 


1616 


1 


P2 00_ 


_MYCGE 


Q49429 


mycoplasma 


28 


257 . 5 


4 , 


.3 


1781 


1 


AK12_ 


HUMAN 


Q02952 


homo sapien 


29 


257 . 5 


4 , 


.3 


2738 


1 


PGCV_ 


~RAT 


Q9erb4 


rattus norv 


30 


253 


4, 


,3 


1189 


1 


YJH6_ 


_ YEAST 


P47035 


saccharomyc 


31 


252 


4 , 


.3 


4377 


1 


ANK3_ 


_HUMAN 


Q12955 


homo sapien 


32 


250.5 


4 , 


.2 


1861 


1 


MAP2_ 


"rat 


P15146 


rattus norv 


33 


249.5 


4 , 


.2 


1827 


1 


MAP2_ 


[human 


P11137 


homo sapien 


34 


248 


4. 


.2 


3421 


1 


TEGU 


HSVEB 


P28955 


equine herp 


35 


244 


4. 


.1 


1362 


1 


BRD4_ 


[human 


060885 


homo sapien 


36 


242 


4. 


.1 


1790 


1 


USOl 


"yeast 


P25386 


saccharomyc 


37 


239 


4 . 


. 0 


1744 


1 


TANA_ 


"xenla 


Q01550 


xenopus lae 


38 


238.5 


4. 


.0 


1411 


1 


TCOF 


"human 


Q13428 


homo sapien 


39 


236.5 


4, 


.0 


3358 


1 


PGCV_ 


[mouse 


Q62059 


mus musculu 


40 


235.5 


4. 


.0 


1723 


1 


AIMl 


"human 


Q9y4kl 


homo sapien 


41 


233.5 


3. 


.9 


1087 


1 


NFH MOUSE 


P19246 


mus musculu 


42 


232 


3, 


.9 


1140 


1 


YM96 


_YEAST 


Q04893 


saccharomyc 


43 


231 


3. 


.9 


3256 


1 


KI67_ 


_HUMAN 


P46013 


homo sapien 


44 


229.5 


3, 


.9 


1367 


1 


AMYH_ 


[yeast 


P08640 


saccharomyc 


45 


228.5 


3. 


.9 


1306 


1 


MSB2_ 


[yeast 


P32334 


saccharomyc 



ALIGNMENTS 



RESULT 1 




RTN4_ 


_HUMAN 




ID 


RTN4 HUMAN STANDARD; PRT; 1192 AA. 




AC 


Q9NQC3; 094962; Q9BXG5 ; Q9H212; Q9H3I3; Q9UQ42; Q9Y293; 


Q9Y2Y7; 


AC 


Q9Y5U6; 




DT 


28-FEB-2003 (Rel. 41, Created) 




DT 


28-FEB-2003 (Rel. 41, Last sequence update) 




DT 


15-MAR-2004 (Rel. 43, Last annotation update) 




DE 


Reticulon 4 (Neurite outgrowth inhibitor) (Nogo protein) 


( Foocen) 


DE 


(Neuroendocrine-specif ic protein) (NSP) (Neuroendocrine 


specific 


DE 


protein C homolog) (RTN-x) (Reticulon 5) (My043 protein) 




GN 


RTN4 OR NOGO OR ASY OR KIAA088 6. 




OS 


Homo sapiens (Human) . 




OC 


Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 


OC 


Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 


OX 


NCBI TaxID=9606; 




RN 


[1] 




RP 


SEQUENCE FROM N.A. (ISOFORMS 1; 2 AND 3). 




RX 


MEDLINE=20129242; PubMed=10667780 ; 




RA 


Prinjha R., Moore S.E., Vinson M. , Blake S., Morrow R. , 


Christie G. , 


RA 


Michalovich D., Simmons D.L., Walsh F.S.; 




RT 


"Inhibitor of neurite outgrowth in humans."; 




RL 


Nature 4 03:383-384(2000). 




RN 


[2] 





RP SEQUENCE FROM N.A. (ISOFORMS 1 AND 2). 

RC TISSUE=Brain; 

RX MEDLINE=21010696; PubMed=11126360; 

RA Tagami S., Eguchi Y., Kinoshita M. , Takeda M. , Tsujimoto Y. ; 

RT "A novel protein, RTN-XS, interacts with both Bcl-XL and Bcl-2 on 

RT endoplasmic reticulum and reduces their anti-apoptotic activity."; 

RL Oncogene 19:5736-5746(2000). 

RN [3] 

RP SEQUENCE FROM N.A. (ISOFORMS 1; 2 AND 3). 

RX MEDLINE=20237542; PubMed=10773680; 

RA Yang J., Yu L., Bi A.D., Zhao S.-Y.; 

RT "Assignment of the human reticulon 4 gene (RTN4) to chromosome 

RT 2pl4 — >2pl3 by radiation hybrid mapping."; 

RL Cytogenet. Cell Genet. 88:101-102(2000). 

RN [4] 

RP SEQUENCE FROM N.A. (ISOFORM 4) . 

RA Jin W. -L. , Ju G. ; 

RT "Developmentally-regulated alternative splicing in a novel Nogo-A."; 

RL Submitted (NOV-2000) to the EMBL/ GenBank/DDBJ databases. 

RN [5] 

RP SEQUENCE FROM N.A. (ISOFORMS 2 AND 3) . 

RC TISSUE=Placenta, and Skeletal muscle; 

RA Ito T., Schwartz S.M.; 

RT "Cloning of a member of the reticulon gene family in human."; 

RL Submitted (FEB-1999) to the EMBL/ GenBank/DDBJ databases. 

RN [6] 

RP SEQUENCE FROM N.A. (ISOFORM 2). 

RC TISSUE=Fibroblast; 

RA Yutsudo M. ; 

RT "Isolation of a cell death-inducing gene."; 

RL Submitted (JUN-1998) to the EMBL/ GenBank/DDBJ databases. 

RN [7] 

RP SEQUENCE FROM N.A. (ISOFORM 3) . 

RC TISSUE=Pituitary; 

RA Song H., Peng Y., Zhou J., Huang Q., Dai M. , Mao Y.M., Yu Y., Xu X., 

RA Luo B., Hu R,, Chen J.; 

RT "Human neuroendocrine-specif ic protein C (NSP) homolog gene."; 

RL Submitted (JUL-1998) to the EMBL/ GenBank/DDBJ databases. 

RN [8] 

RP SEQUENCE FROM N.A. (ISOFORM 3) . 

RA Gu J.R., Wan D.F., Zhao X.T., Zhou X.M., Jiang H.Q., Zhang P.P., 

RA Qin W.X., Huang Y., Qiu X.K., Qian L.F., He L.P., Li H.N., Yu Y., 

RA Yu J., Han L.H. ; 

RT "Novel human cDNA clone with function of inhibiting cancer cell 

RT growth."; 

RL Submitted (AUG-1999) to the EMBL/ GenBank/DDBJ databases. 

RN [9] 

RP SEQUENCE FROM N.A. (ISOFORM 1) . 

RC TISSUE=Brain; 

RX MEDLINE=99156230; PubMed=100484 85 ; 

RA Nagase T., Ishikawa K.-I., Suyama M. , Kikuno R. , Hirosawa M. , 

RA Miyajima N., Tanaka A. , Kotani H., Nomura N . , Ohara O. ; 

RT "Prediction of the coding sequences of unidentified human genes. XII. 

RT The complete sequences of 100 new cDNA clones from brain which code 

RT for large proteins in vitro."; 

RL DNA Res. 5:355-364(1998). 

RN [10] 



RP SEQUENCE FROM N.A. (ISOFORMS 2 AND 3). 

RC TISSUE=Brain, Ovary, Pancreas, Placenta, and Skeletal muscle; 

RX MEDLINE=22388257; PubMed=1247 7932 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T . , Max S.I., Wang J. , Hsieh F. , 

RA Diatchenko L., Marusina K. , Farmer A. A. , Rubin G.M., Hong L., 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A. , 

RA Fahey J., Helton E . , Ketteman M., Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y . , Bouffard G.G., 

RA Blakesley R.W., Touchman J.W. , Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M. , 

RA Butterfield Y.S.N., Krzywinski M.I., Skalska U., Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences. "; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:168 99-16903(2002). 

RN [11] 

RP SEQUENCE FROM N.A. (ISOFORM 3). 

RX MEDLINE=20499367; PubMed=11042 152 ; 

RA Zhang Q.-H., Ye M. , Wu X.-Y., Ren S.-X., Zhao M. , Zhao C.-J., Fu G., 

RA Shen Y., Fan H.-Y., Lu G. , Zhong M. , Xu X.-R., Han Z.-G., Zhang J.-W., 

RA Tao J., Huang Q.-H., Zhou J., Hu G.-X., Gu J., Chen S.-J., Chen Z.; 

RT "Cloning and functional analysis of cDNAs with open reading frames for 

RT 300 previously undefined genes expressed in CD34 + hematopoietic 

RT stem/progenitor cells."; 

RL Genome Res. 10:1546-1560(2000). 

RN [12] 

RP SEQUENCE OF 482-1192 FROM N.A. (ISOFORM 1/4). 

RC TISSUE=Brain; 

RA Mao Y.M., Xie Y., Zheng Z.H.; 

RL Submitted (MAY-1998) to the EMBL/ GenBank/DDBJ databases. 

RN [13] 

RP SEQUENCE OF 186-1192 FROM N.A. (ISOFORM 1) . 

RC TISSUE=Testis; 

RA Sha J.H., Zhou Z.M., Li J.M. ; 

RL Submitted (JAN-2001) to the EMBL/ GenBank/DDBJ databases. 

RN [14] 

RP TOPOLOGY. 

RC TISSUE=Brain; 

RX MEDLINE=20129259; PubMed=106677 97 ; 

RA GrandPre T . , Nakamura F., Vartanian T., Strittmatter S.M.; 

RT "Identification of the Nogo inhibitor of axon regeneration as a 

RT Reticulon protein."; 

RL Nature 4 03:439-444(2000). 

RN [15] 

RP FUNCTION. 

RC TISSUE=Brain; 

RX MEDLINE=21069055; PubMed=112 01742 ; 

RA Fournier A.E., Grandpre T., Strittmatter S.M.; 

RT "Identification of a receptor mediating Nogo-66 inhibition of axonal 



RT regeneration."; 

RL Nature 409:341-346(2001). 

RN [16] 

RP REVIEW. 

RX MEDLINE=21888956; PubMed=11891768 ; 

RA Ng C.E.L., Tang B.L.; 

RT "Nogos and the Nogo-66 receptor: factors inhibiting CNS neuron 

RT regeneration . " ; 

RL J. Neurosci. Res. 67:559-565(2002). 

CC -!- FUNCTION: Potent neurite outgrowth inhibitor which may also help 

CC block the regeneration of the nervous central system in adults . 

CC Isoform 2 reduces the anti-apoptotic activity of Bcl-xl and Bcl-2 . 

CC This is likely consecutive to their change in subcellular 

CC location, from the mitochondria to the endoplasmic reticulum, 

CC after binding and sequestration. 

CC -!- SUBUNIT: Binds to RTN4R. Interacts with Bcl-xl and Bcl-2. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. Endoplasmic 

CC reticulum. Anchored to the membrane of the endoplasmic reticulum 

CC through 2 putative transmembrane domains. 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=4; 

CC Name=l; Synonyms=RTN 4A, Nogo-A, RTN-xL; 

CC IsoId=Q9NQC3-l; Sequence-Displayed; 

CC Name=2; Synonyms=RTN 4B, Nogo-B, RTN-xS, Foocen-M; 

CC IsoId=Q9NQC3-2; Sequence-VSP_005655 ; 

CC Name=3; Synonyms=RTN 4C, Nogo-C, Foocen-S; 

CC IsoId=Q9NQC3-3; Sequence=VSP_005652 , VSP_005653; 

CC Name=4 ; 

CC IsoId=Q9NQC3-4; Sequence=VSP_005654 ; 

CC -!- TISSUE SPECIFICITY: Isoform 1 is specifically expressed in brain 
CC and testis and weakly in heart and skeletal muscle. Isoform 2 is 

CC widely expressed excepted for the liver. Isoform 3 is expressed in 

CC brain, skeletal muscle and adipocytes. Isoform 4 is testis- 

CC specific. 

CC -!- SIMILARITY: Contains 1 reticulon domain. 

CC -!- CAUTION: Ref.ll sequence differs from that shown due to 
CC frameshifts in positions 1149 and 1156. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AJ251383; CAB99248.1; 

DR EMBL; AJ251384; CAB99249.1; -. 

DR EMBL; AJ251385; CAB99250.1; -. 

DR EMBL; AB040462; BAB18927.1; 

DR EMBL; AB040463; BAB18928.1; -. 

DR EMBL; AF148537; AAG12176.1; 

DR EMBL; AF148538; AAG12177.1; 

DR EMBL; AF087901; AAG12205.1; -. 

DR EMBL; AF320999; AAG40878.1; -. 

DR EMBL; AF132047; AAD31021.1; 

DR EMBL; AF132048; AAD31022.1; 



DR EMBL; 

DR EMBL ; 

DR EMBL; 

DR EMBL; 

DR EMBL; 

DR EMBL; 

DR EMBL; 



AB015639; 
AF077050; 
AF177332; 
AB020693; 
BC001035; 
BC007109; 
BC014366; 



BAA83712.1 
AAD27783, 1 
AAG17976.1 
BAA74909.1 
AAH01035.1 
AAH07109. 1 
AAH14366.1 



Query Match 98.2%; Score 5815; DB 1; Length 1192; 

Best Local Similarity 97.2%; Pred. No. 1.8e-215; 

Matches 1160; Conservative 4; Mismatches 13; Indels 16; Gaps 3 

Qy 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I M 
Db 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 



Qy 61 AGL S AAP VPT APAAGAP LMD FGN D FVP PAP RG P L P AAP P VAP ERQ P S WD P S P VS S T VP AP 12 0 

i I I I I I M M I I I I I I I I I I I I I I I I I I I I I I I I | | I | I | M | | | | M I I I I I I I I I I I I 
Dfo 61 AGL S AAP VPTAP AAGAP LMD FGN D FVP PAP RG P L PAAP P VAP E RQ P S WD P S P VS S T VPAP 12 0 

Qy 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 180 

I I I I I I I I I 1 I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I || I I || I I I M | | M 
Db 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 180 

Qy 181 SSGA WXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSP 226 

I I I : I : : I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I 

Db 181 S S GS VDETL FALPAAS E PVI RS SAENMDLKEQP GNT I SAGQEDFP S VLLETAAS L P S L S P 240 



Qy 227 LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 28 6 

I I I I I I I I I I I I I I M II I I I I I I I 1 II I II I I I I I I I I I I I I I I I I I I I I | | | | | | | | | 
Db 241 LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 300 

Qy 287 GS S FSVS PKAESAVI VANPREEI I VKNKDEEEKLVSNNI LHXQQELPTALTKLVKEDEW 34 6 

I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I I II I I 
Db 301 GSS FSVS PKAESAVIVANPREEI I VKNKDEEEKLVSNNI LHNQQELPTALTKLVKEDEW 360 

Qy 347 SSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKV 406 

I I I M II I I II I II I I I I I I M I I I I 1 | | | | | | | | | || | | | | | || | | | M | | | M | | | | | 
Db 361 S S EKAKDS FNEKRVAVEAPMREEYADFKPFERVWEVKDS KEDSDMLAAGGKI ESNLES KV 420 



Qy 407 DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIF 466 

I I M I I I II I I I I I II I I I I I || I M I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIF 480 



Qy 4 67 PLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTE 52 6 

IN I I I I I I I I II I I I I I I M I II I II II I I I I I I I I I M I I I I I II I I I I || I I I I 

Db 4 81 PLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLTKVTE 54 0 

Qy 527 EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 586 

I I M I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I M I I I I I I II I I I I I I I I 

Db 541 EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 600 

Qy 587 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 64 6 

I I M I II I I I I I II I I II I II I II I I I I I I I I I I I I I I I I I I I M I I I I II I I I I I I I I I 

Db 601 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 660 



Qy 



647 EAMSVSL-KVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 7 05 



Db 661 EAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 720 

Qy 7 06 MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 765 

I I I I I I I I I M I I II I I I I I I I I I I I I I I I I I 1 I I I I I I I I I | | | | | | | M I I I I I I I I I 
Db 721 MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 78 0 

Qy 766 EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 825 

I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II II 
Db 781 EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 840 

Qy 826 SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 885 

I I I I I I I I I M I I II II I II I I I I II I I I I I II I I I I || I I I I I I I I I I I I I I || I M I I 
Db 841 SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 900 

Qy 886 ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGH 945 

I I II I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I 
Db 901 ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALA- 959 

Qy 946 TQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGV 1005 

I II I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I II 
Db 960 TQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSWDLLYWRDIKKTGV 1019 

Qy 1006 VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLES 1065 

I M I I I I I I I I II I I I I I I I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I II 
Db 1020 VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI QAIQKSDEGHPFRAYLES 1079 

Qy 1066 EVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLT 1125 

I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I II II I I I I II I I I I I I I I II I I I I I I I 
Db 1080 EVAISEELVQKYSNSTaGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLT 1139 

Qy 1126 LLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1178 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II II I I I I I I I I I I 
Db 1140 LLII^ISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1192 

RESULT 2 
RTN4 RAT 



ID RTN4_RAT STANDARD; PRT; 1163 AA. 

AC Q9JK11; Q9JK10; Q9R0D9; Q9WUE9; Q9WUF0; 

DT 28-FEB-2003 (Rel. 41, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Reticulon 4 (Neurite outgrowth inhibitor) (Nogo protein) (Foocen) 

DE (Glut4 vesicle 20 kDa protein) . 

GN RTN4 OR NOGO. 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus. 

OX NCBI_TaxID=10116; 

RN. [1] 

RP SEQUENCE FROM N.A. (ISOFORM 3), AND PARTIAL SEQUENCE. 

RC STRAIN=Sprague-Dawley; TISSUE=Adipocyte; 

RX MEDLINE=99249816; PubMed=1023 1557 ; 

RA Morris N.J., Ross S.A. , Neveu J.M., Lane W.S., Lienhard G.E.; 

RT "Cloning and characterization of a 22 kDa protein from rat adipocytes: 

RT a new member of the reticulon family. 11 ; 



RL Biochim. Biophys . Acta 1450:68-7 6(1999). 

RN [2] 

RP SEQUENCE FROM N.A. (ISOFORMS 1; 2 AND 3) . 

RX MEDLINE=20129258; PubMed=10667796; 

RA Chen M.S., Huber A.B., Van der Haar M.E., Frank M. , Schnell L . , 

RA Spillmann A.A. , Christ F. , Schwab M.E.; 

RT "Nogo-A is a myelin-associated neurite outgrowth inhibitor and an 

RT antigen for monoclonal antibody IN-1."; 

RL Nature 403:434-439(2000). 

RN [3] 

RP SEQUENCE FROM N.A. (ISOFORMS 2 AND 4) . 

RC STRAIN=Wistar Kyoto; TISSUE=Vascular smooth muscle; 

RA Ito T . , Schwartz S.M.; 

RT "Cloning of a member of the reticulon gene family in rat: one of two 

RT minor splice variants. "; 

RL Submitted (FEB-1999) to the EMBL/ GenBank/DDBJ databases. 

RN [4] 

RP FUNCTION. 

RX MEDLINE=22033691; PubMed=12037567 ; 

RA GrandPre T . , Li S., Strittmatter S.M.; 

RT "Nogo-66 receptor antagonist peptide promotes axonal regeneration."; 

RL Nature 417:547-551(2002). 

CC -!- FUNCTION: Potent neurite outgrowth inhibitor which may also help 
CC block the regeneration of the nervous central system in adults (By 

CC similarity) . 

CC -!- SUBUNIT: Binds to RTN4R. Interacts with Bcl-xl and Bcl-2 (By 
CC similarity) . 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. Anchored to the 
CC membrane of the endoplasmic reticulum through 2 putative 

CC transmembrane domains (By similarity) . 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=4; 

CC Name=l; Synonyms=Nogo-A, NI-220-250; 

CC IsoId=Q9JKll-l; Sequence=Displayed; 

CC Name=2; Synonyms=Nogo-B, Foocen-Ml; 

CC IsoId=Q9JKll-2; Sequence=VSP_005658 ; 

CC Name=3; Synonyms=Nogo-C, VP2 0; 

CC IsoId=Q9JKll-3; Sequence=VSP_005656, VSP_005657; 

CC Name=4; Synonyms=Foocen-M2 ; 

CC IsoId=Q9JKll-4; Sequence=VSP_005659; 

CC -!- TISSUE SPECIFICITY: Isoforms 1, 2 and 3 are present in optic 
CC nerve, spinal cord and cerebral cortex. Isoforms 1 and 2 are 

CC present in dorsal root ganglion, sciatic nerve and PC12 cells 

CC after longer exposure. Isoforms 2 and 3 are detected in kidney, 

CC cartilage, skin, lung and spleen. Isoform 3 is expressed at high 

CC level in skeletal muscle. In adult animals isoform 1 is expressed 

CC mainly in the nervous system. 

CC -!- SIMILARITY: Contains 1 reticulon domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 



DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
SQ 



EMBL; 
EMBL; 

EMBL; AJ242962; 

EMBL; AJ242963; 

EMBL; AF132045; AAD31019 

EMBL; AF132046; AAD31020 



AF051335; AAF01564. 
AJ242961; CAB71027. 

CAB71028 . 
CAB71029, 



GO; GO:0030176; C:integral to endoplasmic reticulum membrane; IDA. 
GO; GO:0005635; C: nuclear membrane; ISS. 
GO; GO:0005515; F:protein binding; ISS. 

GO; GO: 0019987; Prnegative regulation of anti-apoptosis ; ISS. 
GO; GO: 0030517; P:negative regulation of axon extension; ISS. 
InterPro; IPR003388; Reticulon. 
Pfam; PF02453; Reticulon; 1. 
PROSITE; PS50845; RETICULON; 1. 

Endoplasmic reticulum; Alternative splicing; Transmembrane. 



DOMAIN 

TRANSMEM 

DOMAIN 

TRANSMEM 

DOMAIN 

DOMAIN 

DOMAIN 

DOMAIN 

DOMAIN 

VARSPLIC 

VARSPLIC 



VARSPLIC 

VARSPLIC 

CONFLICT 
SEQUENCE 



1 

990 
1011 
1105 
1126 
976 
33 
73 
140 
1 

965 



173 



192 



1130 



989 
1010 
1104 
1125 
1163 
1163 
46 
76 
145 
964 

975 



975 



975 



1131 



CYTOPLASMIC (Potential) . 
POTENTIAL. 

LUMENAL (Potential) . 
POTENTIAL. 

CYTOPLASMIC (Potential) . 

RETICULON. 

POLY-GLU. 

POLY-ALA. 

POLY- PRO. 

Missing (in isoform 3) . 
/FTId=VSP_005656. 

AVLSAELSKTS -> MDGQKKHWKDK (in isoform 
3) . 

/FTId=VSP_005657. 

Missing (in isoform 2) . 

/FTId=VSP_005658 . 

Missing (in isoform 4). 

/FTId=VSP_005659. 

MISSING (IN REF. 3; AAD31020) . 



1163 AA; 126386 MW; 8CB894B09E94F0B6 CRC64 ; 



Query Match 72.5%; Score 4296.5; DB 1; 

Best Local Similarity 74.0%; Pred. No. 2e-157; 
Matches 885; Conservative 104; Mismatches 156; Indels 



Length 1163 ; 

51; Gaps 



Db 



1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 
111:111 I I I I I I I I I I I I I I I I I I I I I I I I I I I II I : I I :: I I I I I I I I I I I I I 
1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 



19; 



58 



60 



Qy 

Db 

Qy 

Db 

Qy 

Db 



59 P AAGL S AAP VPT APAAGAP LMD FGN D FVP PAP RG P L PAAP P VAP E RQ P S WD P S P VS S T VP 118 

I I I I I I I I I I I I I I I I : I I : I I I I I I II I I I I I I I I I I I I I II : I I .: I 

61 PAAGLSAAAVP — P AAAAP L LDFSSDS VP PAP RGPL PAAP PAAP ERQPSWERSPAA P 115 

119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 178 

I I I I I I I M I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAP P S T PAAP K R 166 



179 



224 



RGSSG AWXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSL 

Ml : I I I I I I I I I I I : I : I I I I I I I II I I I I I I I I I 

167 RGSGSVDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSL 22 6 



Qy 



225 SPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYS 284 



I I I I ' I ' I I I " I • • • I • I ••III I I I I I I I I 

Db 227 SPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYS 286 

QY 285 EMGSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDE 344 

I I I I I I I I I I I I :: I I : I I : I I : : I I : I : I I : I I Ml III 
Db 287 EMGSSFKGSPKGESAILVENTKEEVIVRSKDKED-LVCSAALHSPQESPVG KEDR 340 

Qy 34 5 WSSEKAKDSFNEKRVAVEAPMREEY7VDFKPFERVWEVKDSKEDS-DMLAAGGKIESNLE 403 

I II I I I I I I : :: I II : I I I I I I II I M : II I I I : I I I : M I : I : I 

Db 341 WS PEKTMDI FNEMQMS WAPVREEYADFKP FEQAWEVKDTYEGS RDVLAA RANVE 396 

Qy 404 SKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIAT 4 63 

I I I I : I I Mill: MM I : I I I I I I I I : I I I I I I M I I Mill 
Db 397 SKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCAS FT-SATESTTA 455 

Qy 464 NIFPLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFEVAAQDSETDYVTTDNLTK 523 

I I I I I I I M I I I II II I I I M I II M II II I II I I I II I I I I I I I M I Ml 
Db 456 NTFPLLEDHTSENKTDEKKIEERKAQIITEK-TSPKTSNPFLVAVQDSEADYVTTDTLSK 514 

Qy 524 VTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLC 583 

I I I I : I I M I I I I M I I I II I I I I I I II I II I II II M I I II II M I I M I II I I 
Db 515 VTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQLC 574 

Qy 584 PSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPENP 642 

I I M I : M I II II I I I I I II II I II : I II I I I I : I I I Mill I M M I I I II I I 
Db 575 PSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPENP 634 

Qy 643 PPYEEAMSVSLKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSD 702 

I I I M I I M M I I II Mill: I II : I I II II I II I I II I I II I I II M : I II I : 
Db 635 P PYEEAMNVALKALGTKEGI KEPES FNAAVQETEAPYI SIACDLI KETKLSTEPS PDFSN 694 

Qy 703 YSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFE 762 

IIIMI I : II : I : I! I I I I I M I II I I M I I I II I : I I I Ml I I M II I II I I I 
Db 695 YSEIAKFEKSVPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS-E 7 53 

Qy 763 SMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELST 822 

: : : :: =1 = 111 I I I I II I I I I : : I MM : : : II : I I I M II I I I : I 

Db 754 TVAQHK- EERLSAS PQELGKPYLES FQPNLHSTKDA-ASNDI PTLTKKEKI S LQMEEFNT 811 

Qy 823 AVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHK 882 

I : I I I I I I II I M : I : I I II I I I I I I I I I I II I MM II II I M I I II I M I 
Db 812 AIYSNDDLLSSKEDKIKESETFSDSSPIEIIDEFPTFVSAKDDS-PKLAKEYTDLEVSDK 870 

Qy 8 83 SEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSA 942 

I II II M I II I II I I I I I I I I I : : : M M I M I : II : I : I I I 
Db 871 SEIANIQSGADSLPCLELPCDLSFKNIYPK — DEVHVSDEFSENRSSVSKASISPSNVSA 928 

Qy 943 LGHTQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKK 1002 

I I I : MM I I I I I I I MUM II: MM I I I I I I II II II I I I 

Db 929 L-EPQTEMGSIVKSKSLTKEAEKKLPSDTEKEDR3LSAVLSAELSKTSWDLLYWRDIKK 987 

Qy 1003 TGWFGASLFLLLSLTVFSIVSWAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAY 1062 

I M I I I M I II I II M II I I M I II I I I II I I I I I II 1 I I I I I I M M II I II I I I I I I I 
Db 988 TGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAY 1047 

Qy 1063 LESEVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFN 1122 

M I I I M M II II I I I M I I I I II I M II I II I II I II I I I I M I I II II I I I I I I I M 



Db 1048 L E S E VAI SEE L VQ K Y S N SAL GH VN S T I KE L RRL FL VD D LVD S L K FAVLMWVFT YVGAL FN 1107 

Qy 1123 GLTLLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1178 

I I I I I I I I I I I I M : I I II I I I I I I I I I II I I I I : I I I I II I I I I I I I I I I I I I : 
Db 1108 GLTLLILALISLFSIPVIYERHQVQIDHYLGLANKSVKDAMAKIQAKIPGLKRKAD 1163 



RESULT 3 
RTN4 MOUSE 



ID RTN4_MOUSE STANDARD; PRT; 199 AA. 

AC Q99P72; Q9CTE3 ; 

DT 28-FEB-2003 (Rel. 41, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 28-FEB-2003 (Rel; 41, Last annotation update) 

DE Reticulon 4 (Neurite outgrowth inhibitor) (Nogo protein) . 

GN RTN4 OR NOGO. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=100 90; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN-3T3-L1; TISSUE=Adipocyte ; 

RA Coulson A.C., Craggs P.D., Morris N.J.; 

RT "Mouse vp20/RTN4C cDNA . " ; 

RL Submitted (DEC-2000) to the EMBL/ GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE OF 170-199 FROM N.A. 

RC STRAIN=C57BL/6J; TISSUE=Embryo ; 

RX MEDLINE-21085660; PubMed=112 17851; 

RA Kawai J., Shinagawa A., Shibata K. , Yoshino M. , Itoh M., Ishii Y. , 

RA Arakawa T., Hara A., Fukunishi Y., Konno H., Adachi J., Fukuda S., 

RA Aizawa K. , Izawa M. , Nishi K. , Kiyosawa H., Kondo S., Yamanaka I., 

RA Saito T . , Okazaki Y., Gojobori T . , Bono H., Kasukawa T . , Saito R. , 

RA Kadota K. , Matsuda H.A., Ashburner M. , Batalov S., Casavant T., 

RA Fleischmann W. , Gaasterland T., Gissi C, King B., Kochiwa H., 

RA Kuehl P., Lewis S., Matsuo Y. , Nikaido I., Pesole G . , Quackenbush J., 

RA Schriml L.M., Staubli F., Suzuki R. , Tomita M. , Wagner L., Washio T., 

RA Sakai K., Okido T., Furuno M. , Aono H., Baldarelli R. , Barsh G., 

RA Blake J., Boffelli D . , Bojunga N., Carninci P., de Bonaldo M.F., 

RA Brownstein M.J., Bult C, Fletcher C, Fujita M. , Gariboldi M. , 

RA Gustincich S., Hill D., Hofmann M. , Hume D.A., Kamiya M. , Lee N.H., 

RA Lyons P., Marchionni L., Mashima J., Mazzarelli J., Mombaerts P., 

RA Nordone P., Ring B., Ringwald M. , Rodriguez I., Sakamoto N., 

RA Sasaki H., Sato K. , Schoenbach C, Seya T., Shibata Y. , Storch K.-F., 

RA Suzuki H., Toyo-oka K. , Wang K.H., Weitz C, Whittaker C, Wilming L., 

RA Wynshaw-Boris A., Yoshida K. , Hasegawa Y. , Kawaji H., Kohtsuki S., 

RA Hayashizaki Y. ; 

RT "Functional annotation of a full-length mouse cDNA collection . " ; 

RL Nature 4 09:685-690(2001). 

CC -!- FUNCTION: Potent neurite outgrowth inhibitor which may also help 
CC block the regeneration of the nervous central system in adults (By 

CC similarity) . 

CC -!- SUBUNIT: Binds to RTN4R. Interacts with Bcl-xl and Bcl-2 (By 
CC similarity) . 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. Anchored to the 
CC membrane of the endoplasmic reticulum through 2 putative 



cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 

DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 



transmembrane domains (By similarity) . 
ALTERNATIVE PRODUCTS: 

Event=Alternative splicing; Named isof orms=l ; 

Comment=A number of isoforms may be produced; 
Name=l ; 

IsoId=Q99P72-l; Sequence=Displayed; 
SIMILARITY: Contains 1 reticulon domain. 



This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib . ch) . 



EMBL; AF326337; AAK08076.1; 

EMBL; AK003859; -; NOT_ANNOTATED_CDS . 

MGD; MGI : 1915835 ; Rtn4 . 



GO 
GO 
GO 
GO 
GO 
GO 
GO 

InterPro; IPR003388; Reticulon. 
Pfam; PF02453; Reticulon; 1. 



GO: 0005783; C : endoplasmic reticulum; IDA. 

GO: 0030176; C: integral to endoplasmic reticulum membrane; ISS. 

GO: 0005635; C:nuclear membrane; ISS. 

GO: 0005515; F:protein binding; ISS. 

GO: 0019987; P:negative regulation of anti-apoptosis ; ISS. 

GO: 0030517; P: negative regulation of axon extension; ISS. 

GO: 0007399; P : neurogenesis ; IDA. 



DR 


PROSITE; 


PS50845; 


RETICULON; 


1. 


KW 


Endoplasmic reticulum; Alternative splicing; Transmembrane. 


FT 


DOMAIN 


1 


25 


CYTOPLASMIC (Potential) . 


FT 


TRANSMEM 


26 


50 


POTENTIAL. 


FT 


DOMAIN 


51 


137 


LUMENAL (Potential) . 


FT 


TRANSMEM 


138 


162 


POTENTIAL. 


FT 


DOMAIN 


163 


199 


CYTOPLASMIC (Potential) . 


FT 


DOMAIN 


12 


199 


RETICULON. 


SQ 


SEQUENCE 


199 AA; 


22466 MW; 


07BE5D580059ED9C CRC64; 


Query Match 




15.4%; 


Score 915; DB 1; Length 199; 



Best Local Similarity 97.4%; 
Matches 186; Conservative 



Pred. No. 4.9e-29; 
2; Mismatches 3; 



Indels 



0; Gaps 



0; 



Qy 

Db 



988 KTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 1047 
I I I I I I I I I I I I I I I I M I I I I II I I I I II I M I I I I I I I I I I I I I I I I I I I I I I I I I 
9 KDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSWAYIALALLSVTISFRIYKGVI 68 



Qy 1048 QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKF 1107 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I I I II I I II I 
Db 69 QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLKF 128 

Qy 1108 AVLMWFTWGALFNGLTLLILALISLFSVPVIYERHQAQIDHYLGIANKNVKDAMAKIQ 1167 

I I I I I I I I II I I I I I I I II I I II I I I I I I : I I I I I I I I I I I I I M I I I I I : II I I I I I II 
Db 129 AVXMWFTWGAIjFNGLTLLIIAIjISLFSIPVIYERHQAQIDH 188 

Qy 1168 AKIPGLKRKAE 1178 

I I I I I I I I I I I 
Db 189 AKIPGLKRKAE 199 



RESULT 4 
RTN1 RAT 



ID RTN1_RAT STANDARD; PRT; 777 AA. 

AC Q64548; Q64547; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Reticulon 1 (Neuroendocrine-specif ic protein) (S-rex) . 

GN RTN1 OR NSP. 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus. 

OX NCBI_TaxID=10116; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORMS RTN1-B AND RTNl-S) . 

RC STRAIN=Wistar; TISSUE=Brain cortex; 

RX MEDLINE=96386034; PubMed=87 938 64 ; 

RA Baka I.D., Ninkina N.N., Pinon L.G.P., Adu J., Davies A.M., 

RA Georgiev G.P., Buchman V.L.; 

RT "Intracellular compartmentalization of two differentially spliced s- 

RT rex/NSP mRNAs in neurons."; 

RL Mol. Cell. Neurosci. 7:289-303(1996). 

CC -!- FUNCTION: May be involved in neuroendocrine secretion or in 

CC membrane trafficking in neuroendocrine cells. 

CC -!- SUBCELLULAR LOCATION: Endoplasmic reticulum membrane (By 

CC similarity) . 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=2; 

CC Name=RTNl-B; Synonyms=S-RexB; 

CC IsoId=Q64 54 8-l; Sequence=Displayed; 

CC Name=RTNl-S ; Synonyms=S-RexS ; 

CC IsoId=Q64548-2; Sequence=VSP_005647 , VSP_005648; 

CC -!- TISSUE SPECIFICITY: EXPRESSED PREDOMINANTLY IN CENTRAL AND 

CC PERIPHERAL NERVOUS SYSTEM OF NEWBORN AND ADULT RATS. LOW LEVELS 

CC HAVE BEEN ALSO DETECTED IN HEART, ADRENAL GLAND AND SPLEEN. 

CC EXPRESSION OF ISOFORM RTNl-B IS RESTRICTED TO PARTICULAR NEURONAL 

CC TYPES. 

CC DEVELOPMENTAL STAGE: DETECTED ON EMBRYONIC DAY E10 IN THE 

CC HINDBRAIN AND IN Ell IN THE FOREBRAIN. DURING THE NEXT 3 EMBRYONIC 

CC DAYS THE LEVELS OF S-REXS INCREASES AND REMAINS STABLE AT El 3 IN 

CC THE HINDBRAIN AND AT E14 IN THE FOREBRAIN. THE LEVELS OF S-REXB 

CC DOES NOT CHANGE AS SIGNIFICANTLY DURING DEVELOPMENT OF THE 

CC HINDBRAIN. 

CC -!- SIMILARITY: Contains 1 reticulon domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; U17604; AAC53046.1; 

DR EMBL; U17603; AAC53045.1; 



DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

KW Endoplasmic reticulum; Alternative splicing; Transmembrane. 



FT 


TRANSMEM 


604 


624 


POTENTIAL. 


FT 


TRANSMEM 


727 


747 


POTENTIAL. 


FT 


DOMAIN 


590 


111 


RETICULON. 


FT 


DOMAIN 


610 


613 


POLY-LEU. 


FT 


VARSPLIC 


1 


569 


Missing (in isoform RTNl-S) . 


FT 








/FTId-VSP_005647. 


FT 


VARSPLIC 


570 


589 


IPGPLGSDLVPPLPFFNKQK -> MQATADSTKMDCVWSNW 


FT 








KSQ (in isoform RTNl-S) . 


FT 








/FTId=VSP 005648. 


SQ 


SEQUENCE 


777 AA; 


83001 


MW; AF7479C50F28D0AC CRC64; 



Query Match 13.4%; Score 7 91; DB 1; Length 777; 

Best Local Similarity 26.6%; Pred. No. 1.4e-23; 

Matches 290; Conservative 129; Mismatches 308; Indels 364; Gaps 39 

Qy 134 PEDDEPPARPPPPPPASVSPQAE-PVWTPPAPAPAAPPSTPAAPKRRGSSGAWXXXXKI 192 

I: : I I I : I II II I I I : I I 
Db 5 PDLQDEPLSPANPGSQLFGGRGEGEEATPKGARPAQQDGEPAWGS — GAGAGWS 57 

QY 193 MDLKEQPGNTISAGQEDFPSVLLETA ASXP-SLSPLSAASFKEHE YLGNLSTV 244 

: : I 11:111 I : I : I I : : I : I I : I : 

Db 58 SRGLCSGPARSPPVAMETASTGVAAVPDALDHSSSPTLKDGEGACYTSLISDI 110 

Qy 245 L — PTE GTLQE NVSEASKEVSEKAKTLLIDRDLTEFSELEYSEMGSSFS 291 

II III: I I : : I : : I : I I I I : I 

Db 111 CYPPREDSAYFTGILQKENGHITTSESPEELGTPGPS-LPEVPGTEPHGLLSSDSGIEMT 169 

Qy 292 VSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDEWSSEKA 351 

: I I : I : I : : 

Db 170 PAESTEVNKILADPLDQ 186 

Qy 352 KDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKVDKKCF 411 

I : I : 111:111 
Db 187 MKAEACKYIDITRPQEAKGQEEQSPGL 213 

Qy 412 ADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATES — IATNIFPLL 469 

: I I : : I : : Ml: I I I : I I hi 
Db 214 EDKDLDFKDKDSEVSTKPEGVH APNQ P S PVEGKL I KDNL F 253 

Qy 470 EDPTSENXTDEKKIEEKKAQIVTE — KNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTEE 527 

I: I II I: : :M III: I I : : 
Db 254 EESTFAPYIDELSDEQHRMSLVTAPVKITLTEIGPPVMTATHET 297 

Qy 52 8 WANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSFE 587 

: M Ml I I: I: 

Db 298 IPE KQDL CLKPSPDTVPTVT 317 

Qy 588 ESEA TPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSP LEASSVNYESIKH 637 

II : I I I I : I I I I : I | I I : : I I : 
Db 318 VSEPEDDSPGSVTP PSSGTEPSAAESQGKGSVSEDELIAAIKEAKGLSYET 368 

Qy 638 EPENPPPYEEAMSVSLKVSGIKEEIKE PENINAALQETEAPYI S IACDLI KET 690 



Db 369 -TESPRPVGQAAD RPKVKARSGLPTIPSSLDQEASSAESGDSEI — ELVSED 417 

Qy 691 KLSAEPAPDFSDYSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVM 750 

:::| I I I III II I I I ::: 

Db 418 PMASEDALP-SGYVSFGHVSGPPP SPAS PSIQYSILREEREAE 459 

QY 751 LVKESLTETSFESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKK 810 

I I : I : I I : : : II I I : I I : : | | : : : 

Db 460 LDSELIIESCDASSASEESPKREQDSPPM — KPGV LD AIREETSSRATE 506 

Qy 811 EKIPLQMEELSTAVYSNDDLFISKEAQIRETETFSDSSPIE IIDEFPTLISSKTDSF 867 

I : I I I : I I : I : I : : 

Db 507 ERAP SHQGPVEPDPILSFTPVTLQSRPE — 534 

Qy 868 SKLAREYTDLEVSHKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNG 927 

I III I : I I I I : I I :: 
Db 535 PSSGDGAPVPEPP KSQQQKPEEEAVSS SQSP 565 

Qy 928 SATSKVLLLPPDVSALGHTQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLG 987 

; I I : : I : | | : 
Db 566 AAT EIPGPLGSDLVPPLPFFN 586 

Qy 988 KTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 1047 

I : I M I I II I I : I I : I I I : I II I I I | | : | | | I , : M I I I I M I I I II I : 
Db 587 KQKAIDLLYWRDIKQTGIVFGSFLLLLFSLTQFSWSWAYLALAALSATISFRIYKSVL 646 

Qy 104 8 QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKF 1107 

I I : I h I I I M I : I I I I I : : I : I : I I I : : : I I I : I I I I I I I I I I M I I I I I 

Db 647 QAVQKTDEGHPFKAYLELEITLSQEQIQKYTDCLQLYVNSTLKELRRLFLVQDLVDSLKF 7 06 

Qy 1108 AVLMWVFTWGALFNGLTLLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQ 1167 

Mill: I I I II I I II I I I I : : I : : I : I : : I I : I : I I I I : I MM : : Mill 
Db 707 AVLNMLLTWGALFNGLTLLLMAWSMFTLPVVYVKHQAQVDQYLGLVRTHINTWAKIQ 766 

Qy 1168 AKI PGLKRKAE 1178 

Mill II II 
Db 767 AKIPGAKRHAE 777 



RESULT 5 
RTN1_HUMAN 

ID RTN 1_HUMAN STANDARD; PRT; 776 AA. 

AC Q16799; Q16800; Q16801; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Reticulon 1 (Neuroendocrine-specif ic protein) . 

GN RTN1 OR NSP. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 
RN [1] 

RP SEQUENCE FROM N.A. (ISOFORMS RTN1-A; RTN1-B AND RTN1-C) . 

RC TISSUE=Lung carcinoma; 

RX MEDLINE=93293865; PubMed=7 685762 ; 

RA Roebroek A.J.M., Van de Velde H.J.K., Van Bokhoven A., Broers J.L.V., 



RA Ramaekers F.C.S., Van de Ven W.J.M.; 

RT "Cloning and expression of alternative transcripts of a novel 

RT neuroendocrine-specif ic gene and identification of its 135-kDa 

RT translational product."; 

RL J. Biol. Chem. 268:13439-13447(1993). 

RN [2] 

RP ALTERNATIVE SPLICING. 

RX MEDLINE=96429995; PubMed=88 3314 5 ; 

RA Roebroek A.J.M., Ayoubi T.A.Y., Van de Velde H.J.K., 

RA Schoenmakers E.F.P.M., Pauli I.G.L., Van de Ven W.J.M.; 

RT "Genomic organization of the human NSP gene, prototype of a novel gene 

RT family encoding reticulons . " ; 

RL Genomics 32:191-199(1996). 

RN [3] 

RP TISSUE SPECIFICITY. 

RX MEDLINE=98228245; PubMed=95 604 66 ; 

RA Hens J., Nuydens R. , Geerts H., Senden N.H., Van de Ven W.J.M., 

RA Roebroek A. J., van de Velde H.J.K., Ramaekers F.C., Broers J.L.; 

RT "Neuronal differentiation is accompanied by NSP-C expression."; 

RL Cell Tissue Res. 292:229-237(1998). 

CC -!- FUNCTION: May be involved in neuroendocrine secretion or in 

CC membrane trafficking in neuroendocrine cells. 

CC -!- SUBCELLULAR LOCATION: Endoplasmic reticulum membrane. 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=3; 

CC Name=RTNl-A; Synonyms=NSP-A; 

CC IsoId=Q16799-l; Sequence-Displayed; 

CC Name— RTN1-B; Synonyms=NSP-B; 

CC IsoId=Q16799-2; Sequence=VSP_005644 ; 

CC Name=RTNl-C; Synonyms=NSP-C ; 

CC IsoId=Q16799-3; Sequence=VSP_005645, VSP_005646; 

CC -!- TISSUE SPECIFICITY: EXPRESSED IN NEURAL AND NEUROENDOCRINE TISSUES 
CC AND CELL CULTURES DERIVED THEREFROM. EXPRESSION OF ISOFORM RTNl-C 

CC IS STRONGLY CORRELATED WITH NEURONAL DIFFERENTIATION. 

CC -!- PTM: Isoforms RTN1-A and RTNl-B are phosphorylated. 

CC -!- SIMILARITY: Contains 1 reticulon domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; L10333; AAA59950.1; -. 

DR EMBL; L10334; AAA59951.1; 

DR EMBL; L10335; AAA59952.1; -. 

DR PIR; A46583; A46583. 

DR PIR; 160904; 160904. 

DR Genew; HGNC: 104 67; RTN1. 

DR MIM; 600865; -. 

DR GO; GO:0030176; C : integral to endoplasmic reticulum membrane; TAS . 

DR GO; GO: 0004871; F: signal transducer activity; NAS . 

DR GO; GO: 0030182; P: neuron differentiation; TAS. 

DR GO; GO: 0007165; P: signal transduction; NAS. 

DR InterPro; IPR003388; Reticulon. 



DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

KW Endoplasmic reticulum; Alternative splicing; Transmembrane; 

KW Phosphorylation. 

FT TRANSMEM 603 623 POTENTIAL. 

FT TRANSMEM 726 746 POTENTIAL. 

FT DOMAIN 589 776 RETICULON. 

FT DOMAIN 609 612 POLY-LEU. 

FT VARSPLIC 1 420 Missing (in isoform RTN1-B) . 

FT /FTId=VSP_005644. 

FT VARSPLIC 1 568 Missing (in isoform RTN1-C) . 

FT / FT I d= VS P_ 0 0 5 6 4 5 . 

FT VARSPLIC 569 588 GPGPLGPGAPPPLLFLNKQK -> MQATADSTKMDCVWSNW 

FT KSQ (in isoform RTN1-C) . 

FT / FT I d=VS P_0 0 5 6 4 6 . 

SQ SEQUENCE 776 AA; 83617 MW; CA5B6232353096FE CRC64; 

Query Match 13.3%; Score 787.5; DB 1; Length 776; 

Best Local Similarity 31.9%; Pred. No. 2e-23; 

Matches 229; Conservative 91; Mismatches 190; Indels 207; Gaps 25; 

Qy 588 ESEATPSPVLPDI — VMEAPLNSA VPSAGASVIQPSSSPLE ASSVNY 632 

1 I 1 I I I I : : II: I : I : : I I : I : I 

Db 141 EELGTPGPSLPDVPGIESRGLFSSDSGIEMTPAESTEVNKILADPLDQMKAEAYKYIDIT 200 

Qy 633 — ESIKHEPENPPPYEEA MSVSLKVSGIKEEIKEPENINAAL QET 675 

I : t I : : : I I : : I : I I : : I I I : : : | 

Db 201 RPEEVKHQEQHHPELEDKDLDFKNKDTDI S I KPEGVREPDK-PAPVEGKI I KDHLLEEST 259 

Qy 676 EAPYISIACDL IKETKLSAEPAPDFSDYSEMAKVEQPVPDHSELVED 722 

I I I I II : I I II:: : : | : : : 

Db 260 FAPYID DLSEEQRRAPQITTPVKITLTEIEPSVE TTTQEKTPEKQDICLK 309 

Qy 723 SSPDSEPVDLFS DDS IPDVPQKQ DETVMLVKESLTETSF 7 61 

I I I : I I III I : I | I : : M : 
Db 310 PSPDTVPTVTVSEPEDDSPGSITPPSSGTEPSAAESQGKGSISEDELITAIKEA 363 

Qy 762 ESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELS 821 

: I I I : : I III I I : : : 

Db 364 -KGLSYETAENPRPVGQLADRP EVKARSGPPTIPSPLDHEA 403 

Qy 822 TAVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFP TLISSKTDSFS 868 

: : I I | II : : I I : I II 

Db 404 SSAESGD SEIELVSEDPMAAEDALPSGYVSFGHVGGPPPSP 444 

Qy 869 KLAREYTDLEVSHKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSD 921 

: I I : I : : I : : I : I I I III 

Db 445 ASPSIQYSILREEREAELDSELIIESCDASSAS EESPKREQDSPPMKPSALD 496 

Qy 922 DF SKNGSATSKVLL LPPDVSALGHTQAEIESIVKP 956 

I : I I I III II I I : : I 

Db 497 AIREETGVRAEERAPSRRGLAEPGSFLDYPSTEPQPGPELPPGDGAL EPETPMLP 551 

Qy 957 KVLEKEAEKKLPSDTEKEDRSPSA IFSADLGKTSWDLLYWRDIK 1001 

: I : : : I I : I . : I II : I I I I I I I II 
Db 552 RKPEEDSSSNQSPAATKGPGPLGPGAPPPLLF LNKQKAI DLLYWRDI K 599 



Qy 1002 KTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRA 1061 

: I I : I I I : I II II I I I : I II I I : I I I II II I 1 I I I I I : II : II : I I I I I I : I 
Db 600 QTGIVFGSFLLLLFSLTQFSWSVVAYLALAALSATISFRI YKSVLQAVQKTDEGHPFKA 659 

Qy 1062 YLESEVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 1121 

III I : : I : I : I I I :: : I I I : I I I I I I I I I I I II I I I I I I I I I : I I I I I II 

Db 660 YLELEITLSQEQIQKYTDCLQFYVNSTLKELRRLFLVQDLVDSLKFAVLMWLLTYVGALF 719 

Qy 1122 NGLTLLILALI SLFSVPVI YERHQAQI DHYLGLANKNVKDAMAKIQAKI PGLKRKAE 1178 

I I I I I I : : I : : I : I : : I I : I : I II I I I I I I I : : : I I I I 1 I I I I II II 
Db 720 NGLT LLLMAVVSMFTLPWYVKHQAQ I DQ YLGLVRTH INAWAKI QAKI P GAKRHAE 776 



RESULT 6 
RTN3_MOUSE 

ID RTN3_MOUSE STANDARD; PRT; 237 AA. 

AC Q9ES97; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 15-MAR-2004 (Rel. 43, Last annotation update) 

DE Reticulon protein 3. 

GN RTN3 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Huang X., Zhou Y., Qiang H., Yuan J., Qiang B.; 

RT "Cloning and expression profile of a novel mouse cDNA encoding a human 

RT RTN3 homolog."; 

RL Submitted (OCT-1999) to the EMBL/GenBank/DDB J databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Eye; 

RX MEDLINE=22388257; PubMed=12477932 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L. , Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J., Hsieh F., 

RA Diatchenko L. , Marusina K., Farmer A. A. , Rubin G.M., Hong L., 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A. , 

RA Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y. , Bouffard G.G., 

RA Blakesley R.W., Touchman J.W. , Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M. , 

RA Butterfield Y.S.N., Krzywinski M.I., Skalska U. , Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

CC SUBCELLULAR LOCATION: Integral membrane protein. Endoplasmic 



CC reticulum (Potential) . 

CC -!- SIMILARITY: Contains 1 reticulon domain. 



CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 
CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
CC the European Bioinf ormatics Institute. There are no restrictions on its 
CC use by non-profit institutions as long as its content is m no way 
CC modified and this statement is not removed. Usage by and for commercial 
CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
CC or send an email to license@isb-sib . ch) . 



DR EMBL; AF195940; AAG31360.1; 

DR EMBL; BC014697; AAH14697.1; 

DR MGD; MGI: 1339970; Rtn3 . 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

KW Transmembrane; Endoplasmic reticulum. 

FT TRANSMEM 69 8 9 POTENTIAL. 

FT TRANSMEM 167 187 POTENTIAL. 

FT DOMAIN 4 9 237 RETICULON. 

SQ SEQUENCE 237 AA; 25428 MW; EB 6 0A0A7 AC 4 5 FODE CRC64; 

Query Match 10.6%; Score 626; DB 1; Length 237; 

Best Local Similarity 53.2%; Pred. No. 6.9e-18; . c r a. 

Matches 125; Conservative 41; Mismatches 53; Indeis lb; Gaps 

Ov 960 EKEAEKKLPS-DTEKEDRSPSAIFSAD LGKTS WDLLYWRDIKKTG 1004 

| | : I! : Mi: Ml I H=HM;JJ'» 

Db 3 ESSAATQSPSVSSS^ 



5SSGAEPSALGGGGGSPGACPALGAKSCGSSCAVHDLIFWRDVKKTG 62 



Ov 1005 VVFGASLFLLLSLTVFSIVS VTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLE 1064 

UY ,|| :| :|||| = 1 = I I I I M i I I I I : I I I I I I = I I I : I I I M : I I I = 

Db 63 FVFGTTLIMLLSLAAFSVISWSYLILALLSVTISFRVYKSVIQAVQKSEEGHPFKAYLD 122 

Ov 1065 SEVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGL 1124 

WY •■•II I I : I : I I I : I : I I I I I : I I I I I M I I I I : I M I I = M I : 

Db 123 VDITLSSEAFHNYMNAAMVHVNKALKLIIRLFLVEDLVDSLKLAVFMWLMTYVGAVFNGI 182 

Ov 1125 T LLI LALI S LFS VPVI YERHQAQI DHYLGLANKNVKDAMAKI QAKI PGL- KRKAE 1178 

Y MINI : :M I I:: M::: M Ml: I: I I : 11111 = 11= Mill 

Db 183 TLLI LAELLVFS VP I WEKYKTQI DH WGI ARDQTKS IVEKIQAKLPGIAKKKAE 237 



RESULT 7 
RTN3 HUMAN 
ID 
AC 
DT 
DT 
DT 
DE 
DE 
GN 
OS 



OC 
OC 



PRT; 236 AA. 



RTN3_HUMAN STANDARD; 
095197; 

16-OCT-2001 (Rel. 
16-OCT-2001 (Rel. 
10-OCT-2003 (Rel. 
Reticulon protein 
protein II) (NSPLII) . 
RTN3 OR NSPL2. 

Homo sapiens (Human) . . 
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 



40, Created) 

40, Last sequence update) 
42, Last annotation update) 

3 (Neuroendocrine-specif ic protein-like 2) 



(NSP-like 



OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A., AND TISSUE SPECIFICITY. 

RC TISSUE=Retina; 

RX MEDLINE=99265974; PubMed=l 0331947 ; 

RA Moreira E.F., Jaworski C.J., Rodriguez I.R.; 

RT "Cloning of a novel member of the reticulon gene family (RTN3 )• : gene 

RT structure and chromosomal localization to llql3." ; 

RL Genomics 58:73-81(1999). 

RN [2] 

RP SEQUENCE FROM N.A. 

RA Huang X., Zhou Y., Du G., Yuan J., Qiang B.; 

RT "Cloning and expression analysis of a cDNA encoding a novel 

RT neuroendocrine-specif ic protein-like protein 1: NSPL1." ; 

RL Submitted (JAN-1999) to the EMBL/ GenBank/DDBJ databases. 

RN [3] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Brain, Eye, and Lymph; 

RX MEDLINE=22388257; PubMed=12477932 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L. , Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T . , Max S.I., Wang J., Hsieh F., 

RA Diatchenko L., Marusina K. , Farmer A. A., Rubin G.M. , Hong L., 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C. , 

RA Raha S.S., Loquellano N.A. , Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A. , 

RA Fahey J. , Helton E., Ketteman M., Madan A., Rodrigues S., Sanchez A. , 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y. , Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J. , Schmutz J., Myers R.M., 

RA Butterfield Y.S.N. , Krzywinski M.I., Skalska U., Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. Endoplasmic 

CC reticulum (Potential) . 

CC -!- TISSUE SPECIFICITY: WIDELY EXPRESSED WITH HIGHEST EXPRESSION IN 
CC BRAIN. THREE TIMES MORE ABUNDANT IN MACULA THAN IN PERIPHERAL 

CC RETINA. 

CC -!- SIMILARITY: Contains 1 reticulon domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF059524; AAC99319.1; -. 

DR EMBL; AF059529; AAD20951.1; -. 

DR EMBL; AF059525; AAD20951.1; JOINED. 



DR EMBL; AF059526; AAD20951.1; JOINED. 

DR EMBL; AF059527; AAD20951.1; JOINED. 

DR EMBL; AF059528; AAD20951.1; JOINED. 

DR EMBL; AF119297; AAD26810.1 

DR EMBL; BC000634; AAH00634.1 

DR EMBL; BC010556; AAH10556.1 

DR EMBL; BC011394; AAH11394.1 

DR EMBL; BC022993; AAH22993.1 

DR Genew; HGNC: 104 69; RTN3 . 

DR MIM; 604249; -. 

DR GO; GO: 0005615; C : extracellular space; TAS . 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

KW Transmembrane; Endoplasmic reticulum. 

FT TRANSMEM 68 8 8 POTENTIAL. 

FT TRANSMEM 177 197 POTENTIAL. 

FT DOMAIN 48 236 RETICULON. 

SQ SEQUENCE 236 AA; 25609 MW; DDC 6A4 5 4 4 ABCDFB7 CRC64; 



Query Match 10.6%; Score 625.5; DB 1; Length 236; 

Best Local Similarity 54.8%; Pred. No. 7.1e-18; 

Matches 119; Conservative 41; Mismatches 56; Indels 1; Gaps 



1; 



Qy 



Db 



963 AEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSI 1022 

II I I : : : | M : : | M : I I I I I I I : I : I I I I I I : 

2 0 AEPSAPGGGGSPGACPALGTKSCSSSCAVHDLIFWRDVKKTGFVFGTTLIMLLSLAAFSV 79 



Qy 

Db 

Qy 

Db 

Qy 

Db 



1023 VSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSAL 1082 
: I I : I : I I I I I I I I I I I I I I I I I I : I I I : I I I I I : I I I : : : : I I I I : I : 

80 ISWSYLILALLSWISFRIYKSVIQAVQKSEEGHPFKAYLDVDITLSSEAFHNYMNAAM 139 

1083 GHWCTIKELRRLFLVDDLVDSLKFAVIJyiWFTWGALFNGLTLLILALISLFSVPVIYE 1142 

I : I : I : I I I I I : I I I I I I I II I I : I I I I I : I I I : I I I I I I : : I I I I :: I I 
140 VHINRALKLI I RLFLVEDLVDS LKLAVFMWLMT YVGAVFNGITLLI LAELLI FS VP I VYE 199 

1143 RHQAQIDHYLGLANKNVKDAMAKIQAKIPGL-KRKAE 117 8 
: : : I I I I I : I : I I : I I I I I : I I : I : I I I 

200 KYKTQIDHYVGIARDQTKSIVEKIQAKLPGIAKKKAE 236 



RESULT 8 
RTN2_HUMAN 

ID RTN2__HUMAN STANDARD; PRT; 545 AA. 

AC 075298; 060509; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Reticulon protein 2 (Neuroendocrine-specif ic protein-like 1) (NSP-like 

DE protein 1) (NSPLI) . 

GN RTN2 OR NSPLl. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
OX NCBIJTaxID=9606; 
RN [1] 

RP SEQUENCE FROM N . A. , ALTERNATIVE SPLICING, AND TISSUE SPECIFICITY. 



RC TISSUE=Lung carcinoma; 

RX MEDLINE=98360096; PubMed=9693037 ; 

RA Roebroek A.J.M., Contreras B., Pauli I.G.L., Van de Ven W.J.M.; 

RT "cDNA cloning, genomic organization, and expression of the human RTN2 

RT gene, a member of a gene family encoding reticulons . " ; 

RL Genomics 51:98-106(1998). 

RN [2] 

RP SEQUENCE OF 108-545 FROM N . A. (ISOFORM RTN2-B) . 

RC TISSUE-Brain; 

RX MEDLINE=98191726; PubMed=9530622 ; 

RA Geisler J.G., Stubbs L.J., Wasserman W.W., Mucenski M.L.; 

RT "Molecular cloning of a novel mouse gene with predominant muscle and 

RT neural expression."; 

RL Mamm. Genome 9:274-282(1998). 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. Endoplasmic 
CC reticulum (Potential) . 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=2; 

CC Name=RTN2-A; 

CC Isold=075298-1; Sequence=Di splayed; 

CC Note=Isoform RTN2-C is produced by alternative initiation at 

CC Met-341 of isoform RTN2-A; 

CC Name=RTN2-B; 

CC IsoId=075298-2; Sequence=VSP_005649; 

CC Event=Alternative initiation; 

CC Comment=2 isoforms, RTN2-A (shown here) and RTN2-C, are produced 

CC by alternative initiation at Met-1 and Met-341; 

CC -!- TISSUE SPECIFICITY: ISOFORM RTN2-C IS HIGHLY EXPRESSED IN SKELETAL 
CC MUSCLE. 

CC -!- SIMILARITY: Contains 1 reticulon domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF004222; AAC32542.1; -. 

DR EMBL; AF004223; AAC32543.1; -. 

DR EMBL; AF004224; AAC32544.1; 

DR EMBL; AF038540; AAC14910.1; -. 

DR Genew; HGNC: 10468; RTN2 . 

DR MIM; 603183; -. 

DR GO; GO: 0030176; C: integral to endoplasmic reticulum membrane; NAS. 

DR GO; GO: 0004871; F: signal transducer activity; NAS. 

DR GO; GO: 0007165; P: signal transduction; NAS. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

KW Endoplasmic reticulum; Alternative splicing; Transmembrane; 

KW Alternative initiation. 

FT CHAIN 1 545 RETICULON PROTEIN 2, ISOFORM RTN2 -A. 

FT CHAIN 341 545 RETICULON PROTEIN 2, ISOFORM RTN2-C. 

FT INIT_MET 341 341 FOR ISOFORM RTN2-C. 

FT TRANSMEM 368 38 8 POTENTIAL. 



FT TRANSMEM 4 63 4 83 POTENTIAL. 

FT DOMAIN 345 545 RETICULON. 

FT VARSPLIC 272 344 Missing (in isoform RTN2-B) . 

FT /FTId=VSP_005649. 

SQ SEQUENCE 545 AA; 59263 MW; 971FD2F909E1E9E6 CRC64; 

Query Match 8.7%; Score 514; DB 1; Length 545; 

Best Local Similarity 28.2%; Pred. No. 3.7e-13; 

Matches 169; Conservative 81; Mismatches 186; Indels 164; Gaps 18; 

Qy 676 EAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQPVPDHSELVEDSSPD-SEPVDL-F 733 

III III I I I : I : : : I I : I I I : I I 

Db 13 EAP STASSTPDSTEGGNDDSDFRELHTAREFSEEDEE — ETTSQDWGTPRELTF 64 

Qy 734 S DDSIPDVPQKQDETVMLVKESLTETSFESMIEYENKEKLSALPPEG 780 

I II | : | : | : : : | : | | | 

Db 65 SYIAFDGWGSGGRRDSTARRPRPQGRSVSEPRDQHPQPSLGDSLESIPSLSQSPEPGRR 124 

Qy 781 GKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVYSNDDLFISKEAQIRE 840 

II II I I | : |::: | :::: | 

Db 125 GDP DTAPPSE RPLEDLRLRLDHLG WVARGTGSGE 158 

Qy 841 TETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEIANAPDGAGSLPCTEL 900 

: I I : I : I I I I : I I I I II 
Db 159 DSSTSSSTPLE — DEEP QEPNRLETGEAGE EL 18 8 

Qy 901 PHDLSLKNIQPKVEEKI SFSDDFSKNGSATSKVLLLPPDVSALGHT 946 

I I I : I I I : : I I : : : : : I I : I 

Db 189 — DLRLRLAQPSSPEVLTPQLSPGSGTPQAGTPSPSRSRDSNSGPEEPLLEEEEKQWGPL 246 

Qy 947 QAE IESIVKPKVLEKEAE KKLP 968 

: I : | | : | :: | | | : | 

Db 247 E RE P VRGQC L D S T DQ L E FT VE P RL L GT AMEW L KT S L LLAVYKT VP I L E L S P P LWT AI GWV 306 

Qy 969 SDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGWFGASL 1011 

: : : II I I : I : I I I I I I : I : : I I I I : 

Db 3 07 QRGPT P PT PVLRVLLKWAKS PRS S GVP S LS LGADMG- S KVADLL YWKDTRT S GWFTGLM 365 

Qy 1012 FLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISE 1071 

III I I I I II I : : I I II I I I I : I : I : II : : I : I I : I I I : : : : : 
Db 366 VSLLCLLHFSIVSVAAHL7VLLLLCGTISLRVYRKVLQAVHRGDGANPFQAYLDVDLTLTR 425 

Qy 1072 ELVQKYSNS7VLGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILAL 1131 

I : : I : I : I I I I I : I I I I I I I I : I : : I : II I : I I II I I I I I : 

Db 42 6 EQTERLSHQITSRWSAATQLRHFFLVEDLVDSLKLALLFYILTEVGAIFNGLTLLILGV 4 85 

Qy 1132 ISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGL KRKAE 1178 

I I I : : I : : I : I I I I I I I : I I : I I I : I I I I I I I I I 

Db 486 I GLFT I PLLYRQHQAQI DQYVGLVTNQLSHI KAKI RAKI PGTGALASAAAAVS GS KAKAE 545 



RESULT 9 
RTN2_MOUSE 

ID RTN2_MOUSE STANDARD; PRT; 471 AA. 

AC 070622; O70620; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 



DT 15-MAR-2004 (Rel. 43, Last annotation update) 

DE Reticulon protein 2 (Neuroendocrine-specif ic protein-like 1) (NSP-like 

DE protein 1) (NSPLI) . 

GN RTN2 OR NSPLI. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORMS 1 AND 2), AND TISSUE SPECIFICITY. 

RC STRAIN=FVB/N, and 129/Sv; TISSUE=Cerebellum, and Skeletal muscle; 

RX MEDLINE-98191726; PubMed=9530622 ; 

RA Geisler J.G., Stubbs L.J., Wasserman W.W., Mucenski M.L.; 

RT "Molecular cloning of a novel mouse gene with predominant muscle and 

RT neural expression."; 

RL Mamm. Genome 9:274-282(1998). 

RN [2] 

RP SEQUENCE FROM N.A. (ISOFORM 1) . 

RC TISSUE=Retina; 

RX MEDLINE-22388257; PubMed=12477932 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L. , Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T . , Max S.I., Wang J., Hsieh F. , 

RA Diatchenko L., Marusina K. , Farmer A. A. , Rubin G.M., Hong L., 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki. S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A. , Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C, Hale S., Garcia A.M. , Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A. , 

RA Fahey J., Helton E., Ketteman M. , Madan A. , Rodrigues S., Sanchez A. , 

RA Whiting M. , Madan A. , Young A.C, Shevchenko Y. , Bouffard G.G., 

RA Blakesley R.W., Touchman J.W. , Green E.D., Dickson M.C., 

RA Rodriguez A.C, Grimwood J., Schmutz J., Myers R.M., 

RA Butterfield Y.S.N., Krzywinski M.I., Skalska U. , Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15, 000 full-length 

RT human and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(20 02). 

CC -!- SUBCELLULAR LOCATION: Membrane-bound. Endoplasmic reticulum 

CC (Potential) . 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=2; 

CC Name=l; Synonyms=Brain; 

CC I sold=07 0622-1; Sequence^Displayed; 

CC Name=2; Synonyms=Muscle; 

CC IsoId=070622-2; Sequence=VSP_005650, VSP_005651; 

CC -!- TISSUE SPECIFICITY: Expressed predominantly in neural and muscular 
CC tissues. 

CC -!- SIMILARITY: Contains 1 reticulon domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 



CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF038537; AAC14906.1; 

DR EMBL; AF038537; AAC14907.1; -. 

DR EMBL; AF038538; AAC14908.1; -. 

DR EMBL; AF038539; AAC14909.1; 

DR EMBL; AF093624; AAD13195.1; -. 

DR EMBL; BC031370; AAH31370.1; -. 

DR MGD; MGI: 107612; Rtn2 . 



DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

KW Endoplasmic reticulum; Alternative splicing; Transmembrane. 

FT TRANSMEM 295 315 POTENTIAL. 

FT DOMAIN 272 471 RETICULON. 

FT VARSPLIC 1 267 Missing (in isoform 2). 

FT /FTId=VSP_005650. 

FT VARSPLIC 268 271 PLLL -> MGSK (in isoform 2). 

FT /FTId=VSP_005651. 

SQ SEQUENCE 471 AA; 51346 MW; 9BBD8F372CF63AD3 CRC64; 



Query Match 8.4%; Score 500; DB 1; Length 471; 

Best Local Similarity 29.3%; Pred. No. l.le-12; 

Matches 159; Conservative 83; Mismatches 177; Indels 124; Gaps 16; 

Qy 676 EAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQPVPDHSELVEDSSPD-SEPVDL-F 733 

Ml III I I I : I : : I I I : I I I : I I 

Db 13 EAP STASSTPDSTEGGNDDSDFRELHTAREFSEDEEE — ETTSQDWGTPRELTF 64 

Qy 734 S DDSIPDVPQKQDETVMLVKESLTETSFESMIEYENKEKLSALPPEG 780 

I | | : | : | : | : : : : : | | | 

Db 65 SYIAFDGWGSGGRRDSWRRPRPQGRSVSEPRDPPQQSGLGDSLESIPSLSQSPEPGRR 124 

Qy 781 GKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVYSNDDLFISKEAQIRE 840 

II I : I I |:: |::::| :: : I I 

Db 125 GDP DPVPPAE RPLEELRLRLDQLG WWRSAGSGE 158 

Qy 841 TETFSDSSPIEIIDEFPTLISS KTDSFSKLARE-YTDLEV SHKSEIAN 887 

I I : I : I : I I : : : I : : I I : : I I I I : I : 

Db 159 DSATSSSTPLE — NEEPDGLEASEAGEETNLELRLAQSLHLQLEVLTPQLSPSSGTPQAH 216 

Qy 888 APDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGHTQ 947 

I I I I I I : I I I : : | | : : 

Db 217 TPSPQRSQDSNSGPDDEPLLNV VEEH WRLLEQEPITAQCLDST 259 

Qy 948 AEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGWF 1007 

: I ::: I : I I I I I I I : I : : I I I 

Db 260 DQSEFMLEPLLL VADLLYWKDTRTSGAVF 288 

Qy 1008 GASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEV 1067 

: III I I I I I I I :: 1 I I I I I I : I : I : I I : : I : | | : | | | : : : 
Db 289 TGLMASLLCLLHFSIVSVAAHLALLGLCATISLRVYRKVLQAVHRGDGTNPFQAYLDMDL 348 

Qy 1068 AI S EELVQKYSNSALGHVNCT I KELRRL FLVDDLVDS LKFAVLMWVFT YVGALFNGLT LL 1127 

: : I : : I II I : I I I I I : I I II I I I I : I : : I : I I I : I II I I I : 

Db 34 9 TLTREQTERLSQQIASHVVSTATQLRHFFLVEDLVDSLKLALLFYILTFVGAI FNGLTLV 4 08 



Qy 1128 ILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGL KR 1175 

I I : : : I I : I I : : I : I I I I I I I : I I : I I I : I I I I I I 

Db 4 09 ILGWALFTVPLLYRQHQAQIDQYVGLVTNQLSHIKAKIRAKIPGTGTLAPTASVSGSKA 4 68 

Qy 1176 KAE 1178 

I I I 

Db 469 KAE 471 

RESULT 10 
CPN DROME 



ID CPN_DROME STANDARD; PRT; 865 AA. 

AC Q02910; 

DT 01-OCT-1993 (Rel. 27, Created) 

DT 01-OCT-1993 (Rel. 27, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Calphotin. 

GN CPN OR CAP. 

OS Drosophila melanogaster (Fruit fly) . 

OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; 

OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; 

OC Ephydroidea; Drosophilidae ; Drosophila. 

OX NCBI_TaxID=7227 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN-Canton-S ; 

RX MEDLINE=93165729; PubMed=8 094 559 ; 

RA Martin J.H., Benzer S., Rudnicka M. , Miller C.A.; 

RT "Calphotin: a Drosophila photoreceptor cell calcium-binding protein."; 

RL Proc. Natl. Acad. Sci. U.S.A. 90:1531-1535(1993). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Canton-S; 

RX MEDLINE=93165730; PubMed=8434 015 ; 

RA Ballinger D.G., Xue N . , Harshman K.D.; 

RT "A Drosophila photoreceptor cell-specific protein, calphotin, binds 

RT calcium and contains a leucine zipper."; 

RL Proc. Natl. Acad. Sci. U.S.A. 90:1536-1540(1993). 

CC -!- FUNCTION: Might function as a calcium-sequestering "sponge" to 
CC regulate the amount of free cytoplasmic calcium. It binds 0.3 mole 

CC of Ca(2+) per mole of protein. 

CC -!- SUBUNIT: Homodimer (Probable). 

CC -!- SUBCELLULAR LOCATION: Cytoplasmic; hypodense compartment. 

CC -!- TISSUE SPECIFICITY: Soma and axons of photoreceptor cells of 

CC compound eyes and ocelli. 

CC -!- DEVELOPMENTAL STAGE: Expressed early in photoreceptor cell 
CC development. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 



DR 


EMBL; L02111; AAA2 


8405. 1; 


-. 


DR 


EMBL; L05080; AAA2 


8420.1; 


— . 


DR 


PIR; A472 


82; A47282. 




DR 


PIR; A472 


83; A47283. 




DR 


FlyBase; 


FBgn0010218; Cpn, 




DR 


GO; GO:0005509; F: 


calcium 


ion binding; IDA. 


KW 


Calcium-binding . 






FT 




•3 D 


36 


A -> AVAPAWA (IN REF. 2) . 


FT 


CONFLICT 


43 


43 


I -> T (IN REF. 2) . 


FT 


CONFLICT 


64 




I -> V (IN REF. 2) . 


FT 


CONFLICT 


76 


76 


T -> A (IN REF. 2) . 


FT 


CONFLICT 


100 


100 


P -> PP (IN REF. 2) . 


FT 


CONFLICT 


126 


127 


VQ -> AP (IN REF. 2) . 


FT 


CONFLICT 


154 


154 


I -> V (IN REF. 2) . 


£ 1 


CONFLICT 


160 


160 


C* m / TUT T*\ TP TT* O \ 

o -> T (IN REF. Z) . 


FT 


CONFLICT 


534 


534 


A -> E (IN REF. 2) . 


FT 


CONFLICT 


699 


699 


I -> T (IN REF. 2) . 


FT 


CONFLICT 


703 


703 


V -> L (IN REF. 2) . 


FT 


CONFLICT 


721 


721 


D -> E (IN REF. 2) . 


SQ 


SEQUENCE 


865 AA; 


84781 


MW; 2110417EOB0E7CFE CRC64; 


Query Match 




5.53 


;; Score 324.5; DB 1; Length 



Best Local Similarity 22.3%; Pred. No. l.le-05; 
Matches 229; Conservative 128; Mismatches 375; Indels 297; Gaps 49 

Qy 59 PAAGL S AAP VPTAPAAGAP LMD FGND FVP PAP RGP L PAAP P VAPERQ P S WD P S P VS S T VP 118 

I : Mill III: I I I I I : I I I | : | | : | 

Db 9 PVSAPVT^APV-TPSAVAAPVQWSPAAVAPAPAAPIAVTPVAPPPTLASVQPATV — TIP 65 

Qy 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAP APAAPP — ST 172 

I I : I : : I I : I : I , I I I I I : I I : I I I I I I II : I 

Db 66 APAPIAAASVTP VASVAPPWAAPTPPAA-SPVSTPVAVAQIPVAVSAPVAPPVAAT 121 

QY 173 PAAPKRRGSSGAWXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSPLSAASF 232 

I : : I : I | | | : II I :||: |: 

Db 122 PTPWQIPVAAPVIAT PPVAASA PT PAAVTPVISPVIAS — 160 

Qy 233 KEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEMGSSFSV 2 92 

1:1 I I I: : I :: I :| 
Db 161 PPWPANTT VPVAAPVAAVPAAVPWAPVLAP AV 194 

Qy 293 SPKAESAVIVAN PREEI I VKNKDEEEKLVSNNI LHXQQELPTALTKLVKEDEWS 34 7 

: I I : I I I I I I : I : | : I I I : 

Db 195 APAV — APWAET PAP P PVAE I P VAT IPECVAPLI PEVSWA 234 

Qy 34 8 SEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKVD 407 

: I : I I I I I | :: : I : : : I 

Db 235 T KPLAAAEPVWAPPATET PWAPAAAS PHVS VAPAVETAWAPVS 280 

Qy 408 KKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIFP 467 

II : : I : I I I I I 
Db 281 ASTEPPV AAATLTTAPETPAL 301 

Qy 4 68 LLEDPTSENXTDEKKIEEKKAQIWEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTEE 527 

I : I I III II 
Db 302 APWAESQ VAA NTWATPP 320 



Qy 528 WANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPA 57 9 

I I I : I : I : I II : | I I : I I I I : I I 

Db 321 TPAPEPETIAPPW AET PEVAS VAVA- ETTP P W — PPVAAESI-PAPWATTPV 371 

Qy 580 -AQLCPSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSS PLEAS SVNYESIKHE 638 

I I : : I: I II :: :|: Ml : I |: I I : 

Db 372 PATLAVTDPDVTASAVPELPPVIAPSPVPSAVAETPVDLAPPVLPPVAAEPVPAWAEET 431 

Qy 639 PENPPPYEEAMSV-SLKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPA 697 

II I I ::::|:: I I : I II : I : : | | | 

Db 432 PETPAPASAPVTIAALDIPEVAPVIAAPSDAPAEAPSAAAPIVSTPPTTASVPETTAPPA 4 91 

Qy 698 P DFSDYSEMAKVEQPVPDHSEL VEDSSPDSEPVDLFSDDSIP — DVPQK 744 

11111:111 I : I I : I I I : I : | 

Db 492 AVPTEPIDVSVLSE-AAIETPVAPPVEVTTEVAVADVAPPEAAADLIIEPVEPPAPIPDL 550 

Qy 745 QDETVMLVKESLTETSFESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEV 804 

: : I : I : : : I : : I I I : : : : | 

Db 551 LEQTTSVPAVEAAESTSSPIPE TSLPPPNEAVASPEVAVAPITAPEPIPEPEP 603 

Qy 805 STLSKKEKIPLQ MEELSTAVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLI 860 

I : I I I : : : : I II I : I I I III III: 
Db 604 S LAT P T E P I P VEAP WI Q EAVDAV EVPVTETST SIP-ETTVEFPEAV 649 

Qy 861 SSKTDSFSKLAREYTDLEV-SHKSEIANAPDGAGSLPCTELP HDLSL 906 

: I I | : | : : : : | | | M III: I : : : 

Db 650 AEKV LDPAITEAPVTTQEPDVANINDGA PAT E I T T PAVE I VT AAAE VS D I AI 701 

Qy 907 KNIQPKVEEKISFSDDFSKNGSATSKVLL LP PDVSALG 944 

I I I : : I : : : : : : : I : : : I I : I I 

Db 702 PVIDPPVPQEIAVA-EIPETDTKPAEVIVEQSTIPIEAPVPEVSKYAEPVISEAPAAEVP 760 

Qy 945 HTQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLY 996 

: I I : I I : I I I : I : :: III I : I : I I 
Db 761 ITAGDNPDNTSVGISEW-PTIAEKPVEEVPTSEIPEQSSSPSD — SVPVAK — ITPLL- 814 

Qy 997 WRDIKKTGV 1005 

I I : : I I 
Db 815 -RDLQTTDV 822 



RESULT 11 
PCLO_HUMAN 

ID PCLO_HUMAN STANDARD; PRT; 5147 AA. 

AC Q9Y6V0; 043373; 060305; Q9BVC8; Q9UIV2; Q9Y6U9; 

DT 28-FEB-2003 (Rel. 41, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 
DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Piccolo protein (Aczonin) (Fragments) . 

GN PCLO OR ACZ OR KIAA0559. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 
RN [1] 

RP SEQUENCE OF 1-759 FROM N.A. 

RC TISSUE=Brain; 



RX MEDLINE=99439764; PubMed=10508 862 ; 

RA Wang X., Kibschull M. , Laue M.M. , Lichte B., Petrasch-Parwez E., 

RA Kilimann M.W. ; 

RT "Aczonin, a 550-kd putative scaffolding protein of presynaptic active 

RT zones, shares homology regions with rim and bassoon and binds 

RT profilin."; 

RL J. Cell Biol. 147:151-162(1999). 

RN [2] 

RP SEQUENCE OF 552-4404 FROM N.A. 

RA Kraemer J., Wollam C, Wohldmann P., McGrane B.; 

RL Submitted (DEC-1999) to the EMBL/ GenBank/DDBJ databases. 

RN [3] 

RP SEQUENCE OF 3619-5147 FROM N.A. (ISOFORM 2) . 

RC TISSUE=Brain; 

RX MEDLINE=98290545; PubMed=9628581 ; 

RA Nagase T. , Ishikawa K.-I., Miyajima N., Tanaka A., Kotani H., 

RA Nomura N. , Ohara O.; 

RT "Prediction of the coding sequences of unidentified human genes. IX. 

RT The complete sequences of 100 new cDNA clones from brain which can 

RT code for large proteins in vitro."; 

RL DNA Res. 5:31-39(1998). 

RN [4] 

RP SEQUENCE OF 4405-4439 FROM N.A. 

RC TISSUE=Placenta; 

RX MEDLINE-22388257; PubMed=12477932 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM,, Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J. , Hsieh F., 

RA Diatchenko L., Marusina K . r Farmer A.A. , Rubin G.M., Hong L., 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M. , Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A., 

RA Fahey J., Helton E., Ketteman M. , Madan A., Rodrigues S. f Sanchez A. , 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y. , Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M. , 

RA Butterfield Y.S.N. , Krzywinski M.I., Skalska U. , Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

RN [5] 

RP SEQUENCE OF 4405-5147 FROM N.A. 

RA Kalicki J., Elliott G. ; 

RL Submitted (FEB-1998) to the EMBL/ GenBank/DDBJ databases. 

CC -!- FUNCTION: May act as a scaffolding protein involved in the 
C C organization of synaptic active zones and in synaptic vesicle 

CC trafficking (By similarity) . 

CC -!- SUBUNIT: Interacts with Rabacl/Pral, RIMS2 and profilin (By 
CC similarity) . 

CC -!- SUBCELLULAR LOCATION: Concentrated at the presynaptic side of 
CC synaptic junctions (By similarity) . 

CC -!- ALTERNATIVE PRODUCTS: 



CC Event=Alternative splicing; Named isoforms=2; 

CC Comment=Additional isoforms seem to exist; 

CC Name=l; 

CC IsoId=Q9Y6V0-l; Sequence=Displayed; 

CC Name=2; 

CC IsoId=Q9Y6V0-2; Sequence=VSP_0 03923, VSP_003924, VSP_003925, 

CC VSP_003926, VSP_003927; 

CC Note=No experimental confirmation available; 

CC -!- DOMAIN: C2 domain 1 is involved in binding calcium and 

CC phospholipids. Calcium binds with low affinity but with high 

CC specificity and induces a large conformational change. 

CC SIMILARITY: Contains 2 C2 domains. 

CC -!- SIMILARITY: Contains 1 PDZ/DHR domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; Y19188; CAB60727.1; -. 

DR EMBL; AC004903; AAD20936.1; -. 

DR EMBL; AC004886; AAD21789.1; -. 

DR EMBL; AB011131; BAA25485.1; -. 

DR EMBL; BC001304; AAH01304.1; 

DR EMBL; AC004082; AAB97937.1; -. 

DR PIR; T00634; T00634. 

DR HSSP; P04410; 1A25. 

DR Genew; HGNC: 13406; PCLO. 

DR MIM; 604918; -. 

DR GO; GO: 00 058 56; C : cytoskeleton; NAS . 

DR GO; GO: 0045202; C: synaptic junction; ISS. 

DR GO; GO: 0005509; F: calcium ion binding; ISS. 

DR GO; GO:0005544; F: calcium-dependent phospholipid binding; ISS. 

DR GO; GO: 0005522; F:profilin binding; ISS . 

DR GO; GO:0007010; P : cytoskeleton organization and biogenesis; ISS. 

DR GO; GO: 0016080; P: synaptic vesicle targeting; ISS. 

DR InterPro; IPR000008; C2 . 

DR InterPro; IPR001565; Synaptotagmin . 

DR PRINTS; PR00360; C2 DOMAIN . 

DR PRINTS; PRO 0399; SYNAPTOTAGMN . 

DR SMART; SM00239; C2 ; 2. 

DR PROSITE; PS004 99; C2_DOMAIN_l; 1. 

DR PROSITE; PS50004; C2_DOMAIN_2 ; 2. 

KW Calcium/phospholipid-binding; Zinc; Metal-binding; Zinc-finger; 

KW Repeat; Alternative splicing. 

FT NON_TER 1 1 

FT DOMAIN 400 465 10 X 10 AA TANDEM APPROXIMATE REPEATS OF 

FT P-A-K-P-Q-P-Q-Q-P-X . 

FT ZN_FING 4 99 523 C4-TYPE (POTENTIAL) . 

FT ZN_FING 969 992 C4-TYPE (POTENTIAL) . 

FT NON_CONS 1010 1011 

FT DOMAIN 2300 2325 POLY-PRO. 

FT DOMAIN 4391 4442 PDZ . 

FT DOMAIN 4544 4633 C2 DOMAIN 1. 



FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
SQ 



DOMAIN 
VARSPLIC 



VARSPLIC 
VARSPLIC 
VARSPLIC 
VARSPLIC 
SEQUENCE 



5031 5121 C2 DOMAIN 2. 

4404 4404 S -> SGNGLGIRIVGGKEIPGHSGEIGAYIAKILPGGSAE 

QTGKLMEG (in isoform 2). 

/FTId=VSP_003923. 
4534 4534 K -> KPTDGTKWSHPITGEIQ (in isoform 2) . 

/FTId=VSP_003924 . 
4576 4576 G -> GQVMWQNAS (in isoform 2). 

/FTId=VSP_003925. 
4757 4761 TAHKS -> SKRRK (in isoform 2) . 

/FTId=VSP_003926. 
4762 5147 Missing (in isoform 2) . 

/FTId=VSP_003927 . 
5147 AA; 563537 MW; CD5D84 990498CD3C CRC64 ; 



Query Match 5.4%; Score 320; DB 1; Length 5147; 

Best Local Similarity 21.8%; Pred. No. 0.00015; 

Matches 260; Conservative 152; Mismatches 440; Indels 340; Gaps 61; 

Qy 8 PLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPAAGLSAAP 67 

I : I I I I I :: | | : : Mil 
Db 260 PSLPSPSKPPIQQPTPGKPPAQQPGHEKSQPG PAKPPAQPSGLT 303 

Qy 68 VPTAPAAGA P LMD FGND FVP PAP RG P L PAAP P V 100 

II I I : I I I I I I I I 

Db 304 KPLAQQPGTVKPPVQPPGTTKPPAQPLG — PAKPPAQQTGSEKPSSEQPGPKALAQPPGV 361 

Qy 101 — APERQPSWDPSPVSSTVPAPSPLS AAAVSPSKLPEDDEPPA RPP 144 

I : I I I : I I I I : I I I : I I : I I : : I I 

Db 362 GKT PAQQPG-PAKPPTQQVGTPKPLAQQPGLQSPAKAPGPTKTPAQTKPPSQQPGSTKPP 42 0 

Qy 145 P — PPPASVSPQAEPVWTPPAPAP-AAPPSTPAAPKRRGSSGAWXXXXKIMDLKEQPGN 2 01 

I I II III I I : I : I I I : I I 

Db 421 PQQPGPAKPSPQQPGSTKPPSQQPGSAKPSA QQPSP 456 



Qy 202 TISAGQEDFPSVLLETAASXPSLSPLSAAS FKEHEYLGNLSTVLPTEGTLQENVSEASKE 261 

: I : I : : I I I : I I : I hi I : I 

Db 457 AKPSAQQ-FTKPVSQTGFGKPLQPPTVSPSAKQPPSQGLPKTICPL CNTTELLLH 510 

Qy 2 62 VSEKA KTLLI DRDLTEFSE LEYSEMGSSFSVSPKA 2 96 

I I II : I : : I I I I : I : I : 

Db 511 VPEKANFNTCTECQTTVCSLCGFNPNPHLTEAKEWLCLNCQMKRALGGDLAPVPSSPQPK 57 0 

Qy 297 ESAVIVANPREEIIVKNKDEEEKLVSNNILHXQQEL PTALTKLVKED 343 

I I I : : I : : II I I : I I III: 

Db 571 LKTAPVTTT SAVS KS S PQPQQTS PKKDAAPK QDLSKAPEPKKPPPLVKQP 620 

Qy 344 EWSSEKAK DSFNEKRVAVEAPMREE YADFKPFERVWEVKDSKEDSDM 391 

: I I I II:: I : I 1 I I I : | | : | : 

Db 621 TLHGSPSAKAKQPPEADSLSKPAPPKEPSVPSEQDKAPVADDKP KQPKMVKPTTDL 67 6 

Qy 392 LAAGGKIESNLESKVD KKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYI 448 

: : : I : : I I I : : I : I : : | : Mill: 
Db 677 VSS SSATTKPDIPSSKVQSQAEEKTTPPLKTDSAKPSQSFPPTGEKV 723 



Qy 449 T CAP FN PAATESIATNIFPLLE DPTSENXTDEKKIEEKKAQIVTEKN 495 

M : II: | :: | I | | : II I II I I 

Db 724 — TPFDSKAIPRPASDSKIISHPGPSSESKGQKQVDPV QKKEEPKKAQTKMS PK 775 



Qy 496 TSTK TSNPFFVAAQDSETDYVTTD NLTKVTEEWANMPEGLT 537 

I I I I I : I I : I : I : I 
Db 776 PDAKPMPKGSPTPPGPRPTAGQTVPTPQQSPKPQEQSRRFSLNLGSITD APKSQPTT 832 

Qy 538 PDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPA--AQLCPSFEESEATPSP 595 

I II III : : II:: : I : I I : I I : I 

Db 833 P QET VTGKLFGFGASI-FSQASNLI STAGQPGPHSQSGPGAPMKQA-PAP 880 

Qy 596 VLPDIVMEAPLNSA-VPSAGASVI QPSSS PLEAS SVNYES IK — HEPENPPPY 645 

I I : : I I I I | : : | | : : : | : | | | 

Db 881 SQPPTSQGPPKSTGQAP PAPAKSIPVKKETKAPAAEKLEPKAEQAPTVKRTETEKKPPPI 940 

Qy 646 EEAMSVSLKVSGIKEEIKEPENINAALQETEAPYISIACDLIK-ETKLSAEPAPDFSDYS 704 



Db 941 KDSKSLT AEPQKAVLPTKLEKSPKPESTCPLCKTELNIGSKDPPNFNTCT 990 

Qy 705 EMAKVE QPVPDHSELVE DSSPDSEPVDLFSDD5 737 

I I I : I : | | : | : : :: | 

Db 991 ECKNQVCNLCGFNPTPHLTENCQTQRAISGQLGDIRKMPPAPSGPKAS PMPVPTESSSQK 1050 

Qy 738 — IPDVPQ KQD ETVML--VKESLTETSFESMIEYENKEKLSALPPEGG 781 

: I II II: I I : I I I I : I : I : : I : : I I : 

Db 1051 TAVP — PQVKLVKKQEQEVKTEAEKVILEKVKETLSMEKIPPMVTTDQKQEESKLEKDKA 1108 

Qy 782 KPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVYSNDDLFISKEAQIRET 841 

I I I I I : I : I : : I I I I I : I : I : I I : 

Db 1109 SALQE — KKPLPEEK-KLIPEE-EKIRSEEKKPLLEEKKPTP EDKKLLPEAKTSAP 1160 

Qy 842 ETFSD SSPIEIIDE-FPTLISSKTDSFSKLAREYTDLEVSHKSEIANAPDGA-GSLP 896 

I I : : I : I : : I I I : : : : II III 

Db 1161 EEQKHDLLKSQVQIAEEKLEGRVAPKTVQEGK QPQTKMEGLPSGTPQSLP 1210 

Qy 897 CTELPHDLSLKNI QPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGHTQ 947 

: I : I I II I : I: :::|: I :| |:: 

Db 1211 KED DKTTKTIKEQPQPPCTAKPDQEKEDDKSDTSSSQQPKSPQGLSDTGYSSDGISS 1267 

Qy 948 --AEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGK — TSWDLL 995 

III:: II: II I : : : I I I : : I I I : : I : : I 

Db 1268 SLGEIPSLIPTD— EKDILKGLKKDSFSQESSPSS— PSDLAKLESTVLSIL 1315 

RESULT 12 
PCLO MOUSE 



ID PCLO_MOUSE STANDARD; PRT; 5038 AA. 

AC Q9QYX7; Q9QYX6; Q9QZJ0; 

DT 28-FEB-2003 (Rel. 41, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Piccolo protein (Presynaptic cytomatrix protein) (Aczonin) (Brain- 

DE derived HLMN protein) . 

GN PCLO OR ACZ. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID-10090; 

RN [1] 



RP SEQUENCE FROM N.A. , SUBCELLULAR LOCATION, ALTERNATIVE SPLICING, 

RP TISSUE SPECIFICITY, AND INTERACTION WITH PROFILIN. 

RC TISSUE=Brain; 

RX MEDLINE=99439764; PubMed=l 0508 862 ; 

RA Wang X., Kibschull M., Laue M.M. , Lichte B., Petrasch-Parwez E., 

RA Kilimann M.W. ; 

RT "Aczonin, a 550-kd putative scaffolding protein of presynaptic active 

RT zones, shares homology regions with rim and bassoon and binds 

RT profilin."; 

RL J. Cell Biol. 147:151-162(1999). 

RN [2] 

RP REVISIONS. 

RA Kilimann M.W. ; 

RL Submitted (AUG-2001) to the EMBL/GenBank/DDBJ databases. 

RN [3] 

RP SEQUENCE OF 4502-4682 FROM N.A. 

RC TISSUE=Brain; 

RA Huang W. , Jin W., Huang C, Chen B., Zhang J., Ju G. ; 

RL Submitted (OCT-1999) to the EMBL/GenBank/DDBJ databases. 

RN [4] 

RP INTERACTION WITH RIMS2 . 

RX MEDLINE=22384373; PubMed=124 0 17 93 ; 

RA Fujimoto K. , Shibasaki T., Yokoi N., Kashima Y . , Matsumoto M. , 

RA Sasaki T., Tajima N., Iwanaga T., Seino S.; 

RT "Piccolo, a Ca2+ sensor in pancreatic beta-cells. Involvement of 

RT cAMP-GEFII . Rim2 . Piccolo complex in cAMP-dependent exocytosis . " ; 

RL J. Biol. Chem. 277:50497-50502(2002). 

CC -!- FUNCTION: May act as a scaffolding protein involved in the 
CC organization of synaptic active zones and in synaptic vesicle 

CC trafficking. 

CC -!- SUBUNIT: Interacts with Rabacl/Pral, RIMS2 and profilin. 

CC -!- SUBCELLULAR LOCATION: Concentrated at the presynaptic side of 

CC synaptic junctions. 

CC ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isof orms=2 ; 

CC Name=l; 

CC IsoId=Q9QYX7-l ; Sequence=Displayed; 

CC Name=2; 

CC IsoId=Q9QYX7-2; Sequence=VSP_0 0392 8 , VSP_003929; 

CC -!- TISSUE SPECIFICITY: Highly expressed in brain. Low levels found in 
CC stomach. Not detected in other tissues analyzed including adrenal 

CC gland, testis and pancreas . 

CC -!- DOMAIN: C2 domain 1 is involved in binding calcium and 

CC phospholipids. Calcium binds with low affinity but with high 

CC specificity and induces a large conformational change. 

CC -!- SIMILARITY: Contains 2 C2 domains. 

CC -!- SIMILARITY: Contains 1 PDZ/DHR domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf orraatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; Y19185; CAB60731.2; 



DR EMBL; Y19186; CAB60732.2; 

DR EMBL; AF181269; AAD55786.2; -. 

DR HSSP; P04410; 1A25. 

DR MGD; MGI : 1349390; Pclo. 

DR GO; GO: 0045202; C: synaptic junction; IDA. 

DR GO; GO: 0005509; F: calcium ion binding; ISS. 

DR GO; GO: 0005544; F: calcium-dependent phospholipid binding; ISS. 

DR GO; GO:0005522; F:profilin binding; IDA. 

DR GO; GO: 0019933; P : cAMP-mediated signaling; IDA. 

DR GO; GO: 0007010; P : cytoskeleton organization and biogenesis; IDA. 

DR GO; GO: 0030073; P:insulin secretion; IDA. 

DR GO; GO: 0017157; P: regulation of exocytosis; IDA. 

DR GO; GO: 0016080; P: synaptic vesicle targeting; NAS . 

DR InterPro; IPR000008; C2 . 

DR InterPro; IPR001478; PDZ . 

DR InterPro; IPR008899; Znf_piccolo. 

DR Pfam; PF00168; C2 ; 2. 

DR Pfam; PF00595; PDZ; 1. 

DR Pfam; PF05715; Zf_piccolo; 2. 

DR SMART; SM00239; C2 ; 2. 

DR SMART; SM00228; PDZ; 1. 

DR PROSITE; PS00499; C2_DOMAIN_l; 1. 

DR PROSITE; PS50004; C2_DOMAIN_2 ; 2. 

DR PROSITE; PS50106; PDZ ; 1. 

KW Calcium/phospholipid-binding; Metal-binding; Zinc; Zinc-finger; 

KW Repeat; Alternative splicing. 



FT 


DOMAIN 


371 


470 


10 X 10 AA TANDEM APPROXIMATE 


FT 








P-A-K-P-Q-P-Q-Q-P-X . 


FT 


ZN_FING 


502 


526 


C4-TYPE (POTENTIAL) . 


FT 


ZN_FING 


967 


990 


C4-TYPE (POTENTIAL) . 


FT 


DOMAIN 


2305 


2329 


POLY-PRO. 


FT 


DOMAIN 


4394 


4488 


PDZ. 


FT 


DOMAIN 


4607 


4705 


C2 DOMAIN 1. 


FT 


DOMAIN 


4922 


5012 


C2 DOMAIN 2. 


FT 


VARSPLIC 


4829 


4833 


TKPTN -> SKRRK (in isoform 2) 


FT 








/FTId=VSP_003928. 


FT 


VARSPLIC 


4834 


5038 


Missing (in isoform 2) . 


FT 








/FTId=VSP_003929. 


SQ 


SEQUENCE 


5038 


AA; 547600 


MW; DADA460CF3B40888 CRC64; 



Query Match 5.3%; Score 315; DB 1; Length 5038; 

Best Local Similarity 22.0%; Pred. No. 0.00023; 

Matches 254; Conservative 136; Mismatches 435; Indels 328; 



Gaps 57; 



Qy 

Db 



57 RKPA — AGLSAAPVPTAPAAGAPLMDFG NDFVPPAPR GPLPAAPPVAPE 103 

: I I I I : I II I II: I : I I : I 

288 KSPAQPAGTGKSPAQPPVTAKPPAQQAGLEKTSLQQPGPKSLAQTPGQGKVPPGPAKSPA 347 



Qy 

Db 

Qy 

Db 



104 RQ PSWDPSP-VSSTVPAPSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQ-A 155 

: I I : I I : I I I I : I III I : I I I II 

34 8 QQPGTAKLPAQQPGPQTASKVPGPTKTPAQLSGPGKTPAQQPGPTKPSPQQPIPAKPQPQ 4 07 

156 EPVWT PPAPAPAAPP STPAAPKRRGSSGAWXXXXKIMDLKEQPGNTISAGQED 209 

: II I I M I I I I I I I : : : I III I : 

408 QPVATKPQPQQPAPAKPQPQHPTPAKPQPQQPTPA KPQPQQPTPAKPQP 456 



Qy 



210 FPSVLLETAASXPSLS PLSA — ASFKEHEYLGNLSTVLP TEGTLQENVS 256 



Db 457 QHPGLGKPSAQQPSKSISQTVTGRPLQAPPTSAAQAPAQGLSKTICPLCNTTELLLHTPE 516 

Qy 257 EASKEVSEKAKTLLI DRDLTEFSEL EYSEMGSSFSVSPKA 296 

: I : : : : : : III I I : : I I : I I I 

Db 517 KANFNTCTECQSTVCSLCGFNPNPHLTEIKEWLCLNCQMQRALGGEL7\AIPSSPQPTPKA 576 

Qy 297 ESAVIV ANPREEIIVKN KDEEEK LVSNNILHX QQE 331 

I | : | : : | : | | | | I | | I 

Db 577 ASVQPATASKSPVPSQQASPKKELPSKQDSPKAPESKKPPPLVKQPTLHGPTPATAPQPP 636 

Qy 332 LPTALTKLV— KEDEWSSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDS 389 

: I I I I : I : I I I I : I I : I : : : : I I 

Db 637 VAEALPKPAPPKKPSAALPEQAK APV ADVEPKQP — KTTETLTDS 679 

Qy 390 DMLAAGGKIESNLESKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYIT 44 9 

II : I I : I I : I :: I : I I I I : I 
Db 680 PSSAAATSKPAILSSQVQ AQAQVTTAPPLKTDSAKTSQSFPPTGD T 725 

Qy 450 CAP FN PAATESIATNIFPLLEDPTSENXTD-EKKIEEKKAQIVTEKNTSTKTSN 502 

I : II: I : : I M I : : I I I I I I I II 
Db 726 ITPLDSKAMPRPASDSKIVSH PGPTSESKDPVQKKEEPKKAQTKVTPKPDTK 777 

Qy 503 PFFVAAQDSETDYVTTDNLTKVTEEV VANMPEGLTPDLVQEACE 546 

I : |||::: :|: |: I I I 

Db 778 PVPKGSPTPSGTRPTTGQATPQSQQPPKPPEQSRRFSLNLGGIADAPKS-QPTTPQET — 834 

Qy 547 SELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSFEESEATPSPVLP-- — DIVM 602 

III : : II:: III: III I : 

Db 835 VTGKLFGFGASI-FSQASNLIS TAGQQAPHPQTGPAAPSKQAPPPSQTLAA 884 

Qy 603 EAPLNSA — VPSAGASVIQ PSSSPLEASSVNYESI KHEP ENPP 643 

: I I I I I I I : : I I I : : III II 

Db 8 85 QGPPKSTGQHPSAPAKTTAVKKETKGPAAENLEAKPAQAPTVKKAEKDKKHPPGKVSKPP 94 4 

Qy 644 PYEEAMSVSLKVSGIKEEIKEPENINAALQETEAPYISIACDLIK-ETKLSAEPAPDFSD 702 

I III I : : | | | : | : : : | : I : 

Db 945 P TEPEKAVLAQKPDKTTKPKPACPLCRTELNVGSQDPPNFNT 986 

Qy 703 YSEMAKVE QPVPDHSELVE DSSPDSEPVD 731 

: I I I : I : I I I : I I 

Db 987 CTECKNQVCNLCGFNPTPHLTEIQEWLCLNCQTQRAISGQLGDMDKMPPASSGPKASPVP 104 6 

Qy 732 LFSD DSIPDVPQKQDETV MLVKESLT 7 57 

: : : : : I : I : : : | : : I 

Db 1047 APAEPPPQKTPTAAHAKGKKKETEVKAETEKQIPEKETPSIEKTPPAVATDQKLEESEVT 1106 

Qy 758 ETSFESMIEYENKEKLSAL PPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSK 8 09 

: : : | : | : | | I I I I I I I : I I I I : I : 

Db 1107 KSLVSVLPEKKPSEEEKALPADKKEKKPPAAEAPPLEEKKPIPDDQK — LPPDAKPSASE 1164 

Qy 810 KEKIPLQMEELSTAVYSNDDLFISKEAQIR-ETETFSDSSPIEIIDEFPTLISSKTDSFS 868 

I : : : I I : : I I I : I I I : I : : I : 
Db 1165 GEE KRDLLKAHVQIPEEGPIGKVASLACEGEQQPDTRPEDLPGATPQTLPKD 1216 

Qy 869 KLAREYTDLEVSHKSEIANAPDGAG--SLPCTELPHDLS--LKNIQPKVEEKISFSDDFS 924 

I : : I : I : I I I II II : | I I : : I : I 



Db 



1217 - 



— RQKESRDVTQPQAEGTAKEGRGEPSKDRTEKEEDKSDTSSSQQPKSPQGLS-DTGYS 1272 



Qy 925 KNGSATSKVLLLPPDVSALGHTQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSA 984 

: I : I II III:: I I : I I I : : : I I I : : 

Db 1273 SDGISGS LG EIPSLI PSD— EKDLLKGLKKDSFSQESSPSS--PS 1313 

Qy 985 DLGK — TSWDLL 995 

III : : 1 : : I 
Db 1314 DLAKLESTVLSIL 1326 



RESULT 13 
ANK2_HUMAN 

ID ANK2_HUMAN STANDARD; PRT; 3924 AA. 

AC Q01484; Q01485; 

DT 01-APR-1993 (Rel. 25, Created) 

DT 01-OCT-1996 (Rel. 34, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Ankyrin 2 (Brain ankyrin) (Ankyrin B) (Ankyrin, nonerythroid) . 

GN ANK2 . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORMS 1; 2 AND 3). 

RC TISSUE=Brain stem; 

RX MEDLINE=91302466; PubMed=1830053 ; 

RA Otto E., Kunimoto M. , McLaughlin T., Bennett V. ; 

RT "Isolation and characterization of cDNAs encoding human brain 

RT ankyrins reveal a family of alternatively spliced genes.'*; 

RL J. Cell Biol. 114:241-253(1991). 

RN [2] 

RP REVISIONS. 

RA Carpenter S . ; 

RL Submitted (MAY-1999) to the EMBL/ GenBank/DDBJ databases. 

RN [3] 

RP SEQUENCE FROM N.A. (ISOFORM 1). 

RC TISSUE=Brain stem; 

RX MEDLINE=94075409; PubMed=8253844 ; 

RA Chan W., Kordeli E., Bennett V. ; 

RT "440-kD ankyrinB: structure of the major developmentally regulated 

RT domain and selective localization in unmyelinated axons."; 

RL J. Cell Biol. 123:14 63-14 73(1993). 

RN [4] 

RP SEQUENCE OF 463-495 FROM N.A. 

RX MEDLINE=92009921; PubMed-1 833308 ; 

RA Tse W.T., Menninger J.C., Yang-Feng T.L., Francke U., Sahr K.E., 

RA Lux S.E., Ward D.C., Forget B.G.; 

RT "Isolation and chromosomal localization of a novel nonerythroid 

RT ankyrin gene."; 

RL Genomics 10:858-866(1991). 

CC -!- FUNCTION: Attach integral membrane proteins to cytoskeletal 
CC elements. Also bind to cytoskeletal proteins. 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isof orms=3 ; 

CC Name=l; 



CC IsoId=Q01484-l; Sequence=Displayed; 

CC Name=2 ; 

CC IsoId=Q01484-2; Sequence=VSP_000267 , VSP_000268; 

CC Name =3 ; 

CC IsoId=Q01484-3; Sequence=VSP_000268 ; 

CC -!- TISSUE SPECIFICITY: Plasma membrane of neurons as well as glial 
CC cells throughout the brain. 

CC -!- PTM: Phosphorylated at multiple sites by different protein kinases 

CC and each phosphorylation event regulates the protein's structure 

CC and function (Potential) . 

CC -!- SIMILARITY: Contains 23 ANK repeats. 

CC -!- SIMILARITY: Contains 1 death domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; X56957; CAA40278.1; -. 

DR EMBL; X56958; CAA40279.2; 

DR EMBL; Z26634; CAB42644.1; -. 

DR EMBL; M37123; AAA62828.1; -. 

DR PIR; S37431; S37431. 

DR HSSP; P42771; 1DC2 . 

DR Genew; HGNC:4 93; ANK2 . 

DR MIM; 106410; -. 

DR InterPro; IPR002110; ANK. 

DR InterPro; IPR000488; Death. 

DR InterPro; IPR000906; ZU5 . 

DR Pfam; PF00023; ank; 24. 

DR Pfam; PF00531; death; 1. 

DR Pfam; PF00791; ZU5; 1. 

DR PRINTS; PR01415; ANKYRIN. 

DR SMART; SM0024 8; ANK; 22. 

DR SMART; SM00005; DEATH; 1. 

DR SMART; SM00218; ZU5; 1. 

DR PROSITE; PS50088; AN K_RE P EAT ; 20. 

DR PROSITE; PS50297; ANK_REP_REGION; 1. 

DR PROSITE; PS50017; DEATH_DOMAIN; 1. 

KW Cytoskeleton; Alternative splicing; Repeat; ANK repeat; 

KW Phosphorylation. 



FT 


REPEAT 


63 


92 


ANK 


1. 


FT 


REPEAT 


96 


125 


ANK 


2. 


FT 
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496 


525 


ANK 14 
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ANK 15 
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ANK 16 




FT 


REPEAT 


595 
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ANK 17 




FT 


REPEAT 
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ANK 18 




FT 


REPEAT 


661 
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ANK 19 




FT 


REPEAT 


694 
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ANK 2 0 
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REPEAT 


727 


756 


ANK 21 




FT 


REPEAT 


760 
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ANK 22 




FT 


REPEAT 


793 
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ANK 23 




FT 


DOMAIN 


1773 


1950 


REPEAT - 


-RICH REGION. 


FT 


REPEAT 


1773 


1784 


REPEAT 


A. 


FT 


REPEAT 


1785 


1796 


REPEAT 


A. 


FT 


REPEAT 


1797 


1808 


REPEAT 


A. 


FT 


REPEAT 


1809 


1820 


REPEAT 


A. 


FT 


REPEAT 


1821 


1832 


REPEAT 


A. 


FT 


REPEAT 


1833 


1844 


REPEAT 


A. 


FT 


REPEAT 


1845 


1856 


REPEAT 


A. 


FT 


REPEAT 


1857 


1867 


REPEAT 


A (APPROXIMATE) . 


FT 


REPEAT 


1868 


1879 


REPEAT 


A. 


FT 


REPEAT 


1880 


1891 


REPEAT 


A. 


FT 


REPEAT 


1892 


1902 


REPEAT 


A (APPROXIMATE) . 


FT 


REPEAT 


1903 


1914 


REPEAT 


A. 


FT 


REPEAT 


1915 


1926 


REPEAT 


A. 


FT 


REPEAT 


1927 


1938 


REPEAT 


A. 


FT 


REPEAT 


1939 


1950 


REPEAT 


A. 


FT 


DOMAIN 


3536 


3620 


DEATH. 




FT 


VARSPLIC 


1039 


1039 


Q -> QFLGKLHLPTAPPPLNEGESLVSRILQLGPPGTK 


FT 








(in isof orm 2 ) . 


FT 








/FTId-VSP_000267. 


FT 


VARSPLIC 


1444 


3528 


Missing (in isoform 2 and isoform 3) . 


FT 








/FTId=VSP_000268 . 


FT 


CONFLICT 


475 


476 


GQ -> PE (IN REF. 4) . 


FT 


CONFLICT 


971 


971 


I -> S 


(IN REF. 1) . 


FT 


CONFLICT 


3581 


3582 


QY -> HA (IN REF. 1) . 


FT 


CONFLICT 


3586 


3586 


I -> Y 


(IN REF. 1) . 


SQ 


SEQUENCE 


3924 


AA; 430337 


MW; 52AC496C428E29D2 CRC64; 



Query Match 5.0%; Score 299; DB 1; Length 3924; 

Best Local Similarity 21.5%; Pred. No. 0.00069; 

Matches 258; Conservative 167; Mismatches 417; Indels 358; Gaps 62; 

Qy 14 DSPP RPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLE ELEVLERK 58 

IN : I I | :::::::: | | : II II 

Db 1648 DIPPDETQSTQKQHKPSLGIKKPVRRKLKEKQKQKEEGLQASAEKAELKKGSSEESLGED 1707 

Qy 59 P AAGL S AAP VP TAP AAGAP LMD FGNDFVPPAPRG 92 

I ||: I : I I I : I I : : I : I : I 

Db 17 08 P — GLAPEPLPTVKAT-SPLIEETPIGSIKDKVKALQKRVEDEQKGRSKLPIRVKGKEDV 1764 

Qy 93 PLPAA-PPVAPERQPSWDPSP VSSTVPAPSPL 123 

I I I I I : M III M M: 

Db 1765 PKKTTHRPHPAASPSLKSERHAPGSPSPKTERHSTLSSSAKTERHPPVSPSSKTEKHSPV 1824 



Qy 124 SAAA VSPSKLPEDDEPPA RPPPPPPASVSPQAEPVW 159 

|:| II II I I : I II I: : = I I 

Db 1825 SPSAKTERHSPASSSSKTEKHSPVSPSTKTERHSPVSSTKTERHPPVSPSGKTDKRPPV- 1883 



Qy 160 TPPAPAPAAPPSTPAAPKRR GSSGAWXXXXKIMDLKEQPGNTISAGQEDFPSVLLE 216 

:| II :| ::| II I II :| :: I 

Db 1884 SPSGRTEKHPPVSPGRTEKRLPVSPSGRTD KHQPVSTAGKTEKHLPVSPSG 1934 

Qy 217 TAASXPSLSPLSAAS-FKEHEYLGNLSTVLPT EGTLQENVSEASKEVSEKAKT 268 

I : I I I : I : I : : I I : I I : I I I 

Db 1935 KTEKQPPVSPTSKTERIEETMSVRELMKAFQSGQDPSKHKTGLFEHKSAKQKQPQEKGKV 1994 

Qy 269 LLIDRDLTEFSELEYSEMGSSFSVSPKAES — AVIVANPREEI IVKNKDE — 316 

: I I : : : I : : I I : III I I : : I I I : I 

Db 1995 RVEKEKGP I LTQRE-AQKT ENQT I KRGQRL PVTGTAES KRGVRVS S IGVKKEDAAG 2049 

Qy 317 -EEKLVSNNI LHXQQELPTALTKLVKE DEW 346 

: I I : : I : I I : I : I I : I I I : 

Db 2050 GKEKVLSHKI PEPVQSVPEEESHRESEVPK — EKMADEQGDMDLQI S PDRKTSTDFSEVI 2107 

Qy 347 SSE-KAKDSFNEKRVAVEAPMREEYAD FKPFERVWEVKDSKED 388 

I : I : : I:: I : : I II : : |:: 

Db 2108 KQELEDNDKYQQFRLSEETEKAQLHLDQVLTSPFNTTFPLDYMKDEFLPALSLQSGALDG 2167 

Qy 389 -SDMLAAGGKIESNLESKVDKKCFADSLEQTNHE KDSESSNDDTSFPSTPEGIKDRS 444 

I : I I I I : : II II : I : I : I I : I : : : : : 

Db 2168 SSESLKNEGVAGSPCGSLMEGTPQISSEESYKHEGLAETPETSPESLSF — SPKKSEEQT 2225 

Qy 445 GAYITCAPFNPAATESIATNIFPLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPF 504 

I : II I I I : : : I I I : I : : III:: 

Db 222 6 G ETKESTKTETTTEIRSEKEHPTTKDITGGS — EERGATVTEDSETSTESFQKE 2277 

Qy 505 FVAAQDSETDYWTDNLTKVTEEWA-NMPEGLTPDLVQEACESELNEVTGTKIAYETKM 563 

: I I : I : I I I II : II I I : I : : I 

Db 227 8 ATLGSPKDTSPKRQDDCTGSCSVALAKETPTGLT EEAACDEGQRTFGSS-AHKT — 2330 

Qy 564 DLVQTSEVMQESLYPAAQLCPS FEES EAT PSPVL PDIVMEAP — LNSAV 610 

II III : : I : : I I I I |: :| I |: 

Db 2331 QTDSEAQES TATSDETKALPLPEASVKTDTGTESKPQGVIRSPQGLELAL 2380 

Qy 611 PSAGASVIQ PSSSPLEASSVNYESIKHE-PEN — PPPYEEA-MSVSLKVSG 657 

II : I : I I I I I : : I : I : : I I : I : I I : I 

Db 2381 P S RD S EVLSAVADDS LAVS HKDS LEAS PVLEDNSSHKTPDSLEPSPLKESPCRDSLESSP 2440 

Qy 658 IKEEIKE PEN — INAALQETE — APYISIACDLIKETKLSAEPAPDFSDYSEMAKVE 710 

: : : : I I : : I I : : I I I : I : : : I I I : I : I 

Db 2441 VEPKMKAGIFPSHFPLPAAVAKTELLTEVASVRSRLLRD PDGS — AEDDSLE 2490 

Qy 711 QPVPDHSELVEDS SPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 765 

I : I : I I III: : I : | : : I | 
Db 2491 Q TSLMESSGKSPLSPDTPSSEEVSYEVTPKTTDVSTPKPAVIHECAEED 2539 

Qy 766 EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKK EKIPLQMEELS 821 

: I I I I II III: I I : I I : : I : : I I I I I 

Db 2540 DSENGEKKRFTPEE EMFKMV TKIKMF-DELEQEAKQKRDYKKEPKQEESSS 2589 

Qy 822 TAVYSNDDLFISKEA-QIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVS 880 

: I: I | : : : | : | | : | |::|:: | : 
Db 2590 S SDPDADCSVDVDEPKHTGSGEDESGV PVLVTSESRKVSSSSES 2633 



Qy 881 HKSEIANAPDGAGS LPCTELPHDLSLKN IQPKVEEKISF — SDD 922 

: I : I Ml | : | | : : | | | : : : | : : | 

Db 2634 -EPELAQLKKGADSGLLPEPVIRVQPPSPLPSSMDSNSSPEEVQFQPWSKQYTFKMNED 2692 

Qy 923 FSKNGSATSKVLLLPPDVSALGHTQAEIESIVKPKVLEKEAEKKLPSD TEKEDRS 977 

: : : I I :: I I I I I I I I I 

Db 2693 TQEEPGKSEE EKDSESHLAEDRHAVSTEAEDRS 2725 



RESULT 14 
MAPB HUMAN 



ID MAP B_HUMAN STANDARD; PRT; 2468 AA. 

AC P46821; 

DT 01-NOV-1995 (Rel. 32, Created) 

DT 01-NOV-1995 (Rel. 32, Last sequence update) 

DT 16-OCT-2001 (Rel. 40, Last annotation update) 

DE Microtubule-associated protein IB (MAP IB) [Contains: MAPI light chain 

DE LCI] . 

GN MAP1B. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Fetal brain; 

RX MEDLINE=95104835; PubMed=78 062 12 ; 

RA Lien L.L., Feener C, Fischbach N. , Kunkel L.M.; 

RT "Cloning of human microtubule-associated protein IB and the 

RT identification of a related gene on chromosome 15."; 

RL Genomics 22:273-280(1994). 

CC -!- FUNCTION: The function of brain MAPS is essentially unknown. 

CC Phosphorylated MAP1B may play a role in the cytoskeletal changes 

CC that accompany neurite extension. Possibly MAP1B Binds to at least 

CC two tubulin subunits in the polymer, and this bridging of subunits 

CC might be involved in nucleating microtubule polymerization and in 

CC stabilizing microtubules. 

CC -!- SUBUNIT: 3 different light chains, LCI, LC2 and LC3, can associate 

CC with MAPI A and MAP IB proteins. 

CC -!- DOMAIN: Has a highly basic region with many copies of the sequence 

CC KKEE and KKEI/V, repeated but not at fixed intervals, which is 

CC responsible for the binding of MAP1B to microtubules. 

CC -!- PTM: LCI is coexpressed with MAPlB. It is a polypeptide generated 

CC from MAPlB by proteolytic processing. It is free to associate with 

CC both MAP1A and MAPlB. It interacts with the amino-terminal region 

CC of MAPlB (By similarity) . 

CC -!- SIMILARITY: TO MAP1A. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; L06237; AAA18904.1; 



DR Genew; HGNC:6836; MAP1B . 

DR MIM; 157129; 

DR GO; GO: 0005875; C : microtubule associated complex; TAS . 

DR InterPro; IPR000102; MAPlB_neuraxin . 

DR Pfam; PF00414; MAP IB neuraxin; 10. 



DR PROSITE; PS00230; MAP1B_NEURAXIN; 6. 
KW Microtubule; Repeat; Phosphorylation. 
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2047 


MAP IB 9. 


FT 


REPEAT 


2048 


2064 


MAP1B 10. 


FT 


DOMAIN 


589 


790 


LYS-RICH (HIGHLY BASIC, CONTAINS MANY 


FT 








KKEE AND KKEl/V REPEATS) . 


SQ 


SEQUENCE 


2468 


AA; 270618 


MW; 540839CBDF09D461 CRC64; 



Query Match 5.0%; Score 2 96.5; DB 1; Length 24 68; 

Best Local Similarity 21.5%; Pred. No. 0.00049; 

Matches 2 64; Conservative 14 8; Mismatches 459; Indels 355; Gaps 59; 

Qy 2 EDLDQSPLVSSS-DSPPRPQPAFKYQ FVREP EDEE 35 

II:: 11:111 : : I : : : I I I I : 

Db 956 EDGEEHVCVSASKHSPTEDEESAKAEADAYIREKRESVASGDDRAEEDMDEAIEKGEAEQ 1015 

Qy 36 EEEEEEEEDEDEDLEELEV-LERKPAAGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPL 94 

I I I : I I I : I I I I I : I I I I I I : I I I 

Db 1016 SEEEADEEDKAEDAREEEYEPEKMEAEDYVMAWDKAAEAGGAEEQYGFLTTPTKQLG — 1073 

Qy 95 PAAPPVAPERQPSWDPSPVSSTVPAPSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQ 154 

: I I : I : I I : I I II hi I I : 

Db 1074 AQSPGREPA — SSIHDETLPGGSESEATA SDEENREDQPEEFTATSGYT 1120 



Qy 155 AEPVWTPPAPAPAAPPSTPAAPKRRGSSGAWXXXXKIMDLKEQPGNTISAGQE D 2 09 

: II III : I : I I I I 

Db 1121 QSTIEISSEPTPMDEMSTP RDVMSDETNNEETESPSQEFVNITK 1164 



Qy 210 FPSVLLETAASXPS-LSPLSAASFKE HEYLGNLSTVLPTEGTLQENVSEAS 259 

: I I I I : : : I I : II : I : I I : I : : I : : 

Db 1165 YESSLYSQEYSKPADVTPLN — GFSEGSKTDATDGKDYNASASTISPPSSMEEDKFSRSA 1222 



Qy 260 KEVSEKAKTLLIDRDLTEF SELEYSEMGS SFS 291 

II II I I : I I I I : : I : I I 

Db 1223 LRDAYCSEV — KASTTLDIKDSISAVSSEKVSPSKSPSLSPSPPSPLEKTPLGERSVNFS 1280 



Qy 292 VSP KAESAVIVANPR--EEIIVKN--KDEEEKL VSNNILHX 32 8 

: : I I I : I : I : I : : : : I : : I I : : I 

Db 1281 LTPNEIKVSAEAEVAPVSPEVTQEWEEHCASPEDKTLEWSPSQSVTGSAGHTPYYQSP 1340 



Qy 329 QQELPTALTKLVKEDEWSSEKAKDSFNEKRVAVEAPMREEYADFK-PFERVWEVK 383 

III: : I I I : I : : I I : I I I I : I I : I 
Db 1341 TDEKSSHLPTEV — I EKPPAVPVS FEFSDAKDENERAS VS PMDEPVPDSES P I EKVLS PL 1398 



Qy 384 DSKEDSDMLAAGGKIESNLESKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDR 443 

I : : : III: II I I I : : I I : 

Db 1399 RS PPLIGSESAYESFLSA — DDKASGRGAESPFEEKSGKQGSPDQVSPVSE 1447 

Qy 444 SGAYITCAPFNPAATESIATNIFPLLEDPTSENXTDEKK IEEKK A 488 

: I I : | : | : | | | | I : : : : | : | 

Db 1448 MTSTSLYQDKQEGKSTDFAPIKEDFGQEKKTDDVEAMSSQPALALDERKLGDVSPT 1503 

Qy 4 89 QI VTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTEEWA — N 531 

II : : I I 1 : : | | : | | : : | : | | : 

Db 1504 QIDVSQFGSFKEDTKMSISEGTVSDKSATP — VD E GVAE DT Y S HME GVAS VS T AS VAT S S 1561 

Qy 532 MPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQES-LYPAAQLCP 584 

II III : : I : I : : : : I I I I I : : I : : I I 

Db 1562 FPEPTTDD-VSPSLHAEVGSPHSTEVDDSLSVSWQTPTTFQETEMSPSKEECPRPMSIS 1620 

Qy 585 SFEESEATPSPVLP-DIVMEAPLNSAVPSAGASVI 618 

II : : I I I : : I : I : I I I : 
Db 1621 PPDFSPKTAKSRTPVQDHRSEQSSMSIEFGQESPEQSLAMDFSRQSPDH PTVGAGVL 1677 

Qy 619 Q PSSSPLEASSVNYESIKHE — PENPPPY EEAMSVS-LKVSGIKEEIKE 664 

I : I : I : I : I II I : I I I : : I 

Db 1678 HITENGPTEVDYSPSDMQDSSLSHKIPPMEEPSYTQDNDLSELISVSQVEASPSTSSAHT 1737 

Qy 665 PENINAALQETE AP YISIACDLIKE TKLSAEP A 697 

I I : I I I II ||:::: | M : : 

Db 1738 PSQIASPLQEDTLSDVAPPRDMSLYASLTSEKVQSLEGEKLSPKSDISPLTPRESSPLYS 1797 

Qy 698 PDFSDYSEMAKVEQPVPDHSELVEDSSPDSEPVDLFS 734 

I III: II: II I I I I : I I 
Db 1798 PTFSDSTSAVK-EKTATCHS SSSP PIDAASAEPYGFRASVLFDTMQHHLALNR 1849 

Qy 735 DDSIPDVP QKQDETVMLVKESLTETSFESMIEYENKEKLSALPPE 779 

III: M : II I : I : I I : I I : 

Db 1850 DLSTPGLEKDSGGKTPGDFSYAYQKPEET TRSPDEEDYDYESYEKTTRTSDV 1901 

Qy 780 GGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVYSNDDLFISKEAQIR 839 

I I I I : : : I : I I : I I I I I : I I : 

Db 1902 GGY-YYEKIERTTKSPSDSGYSYE— TIGKTTKTP EDGDYSYE — IIEKTTRTP 1950 

Qy 840 ETETFSDSSPIEIIDEFPTLISSKTDS FSKLAREYTDLEVSHKSEIANAPDGAGSLP 896 

I : I I I I I I I : I I : : I : I I : 
Db 1951 EEGGYSYD 1 SEKTTSPPEVSGYSYEKTERSRRLLDDISNGYDDS 1994 

Qy 8 97 CTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGHTQAE-IESIVK 955 

: I I : : I I I : : II III : I I I : 

Db 1995 -EDGGHTLGDPSYSYETTEKITSFPESEGYSYETSTKTTRTPDTSTYCYETAEKITRTPQ 2053 

Qy 956 PKVLEKE AEKKLPSDTEKE 974 

I I I I I I I : :: 

Db 2054 ASTYSYETSDLCYTAEKKSPSEARQD 2079 

RESULT 15 
MAPB_RAT 

ID MAPB_RAT STANDARD; PRT; 2459 AA. 



AC P15205; Q62958; Q9ER21; Q9QW92; 

DT 01-APR-1990 (Rel. 14, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Micro tubule-as so ciated protein IB (MAP IB) (Neuraxin) [Contains: MAPI 

DE light chain LCI] . 

GN MAP1B. 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus. 

OX NCBI_TaxID=10116; 

RN [1] 

RP SEQUENCE OF 1-142 FROM N.A. 

RC STRAIN=Sprague-Dawley; TISSUE=Testis ; 

RX MEDLINE=96257242; PubMed=8666295 ; 

RA Liu D., Fischer I.; 

RT "Isolation and sequencing of the 5 1 end of the rat microtubule- 

RT associated protein (MAP1B) -encoding cDNA. " ; 

RL Gene 172:307-308(1996). 

RN [2] 

RP SEQUENCE OF 96-2459 FROM N.A., DOMAIN, AND INDUCTION. 

RC STRAIN=Sprague-Dawley; TISSUE=Brain, and Glial tumor; 

RX MEDLINE=92347374; PubMed=1639092 ; 

RA Zauner W., Kratz J., Staunton J., Feick P., Wiche G. ; 

RT "Identification of two distinct microtubule binding domains on 

RT recombinant rat MAP IB."; 

RL Eur, J. Cell Biol. 57:66-74(1992). 

RN [3] 

RP SEQUENCE OF 1541-2459 FROM N.A. , AND TISSUE SPECIFICITY. 

RC TISSUE=Spinal cord; 

RX MEDLINE=90059871; PubMed=2555150 ; 

RA Rienitz A., Grenningloh G. , Hermans-Borgmeyer I., Kirsch J., 

RA Littauer U.Z., Prior P., Gundelfinger E.D., Schmitt B., Betz H.; 

RT "Neuraxin, a novel putative structural protein of the rat central 

RT nervous system that is immunologically related to microtubule- 

RT associated protein 5-"; 

RL EMBO J. 8:2879-2888(1989). 

RN [4] 

RP DEVELOPMENTAL STAGE, AND PHOSPHORYLATION. 

RX MEDLINE=974 05699; PubMed=92 60743 ; 

RA Ma D., Nothias F . , Boyne L.J., Fischer I.; 

RT "Differential regulation of microtubule-associated protein IB (MAPlB) 

RT in rat CNS and PNS during development."; 

RL J. Neurosci. Res. 49:319-332(1997). 

CC -!- FUNCTION: The function of brain MAPS is essentially unknown. 
CC Phosphorylated MAPlB may play a role in the cytoskeletal changes 

CC that accompany neurite extension. Possibly MAPlB Binds to at least 

CC two tubulin subunits in the polymer, and this bridging of subunits 

CC might be involved in nucleating microtubule polymerization and in 

CC stabilizing microtubules. 

CC -!- SUBUNIT: 3 different light chains, LCI, LC2 and LC3, can associate 
CC with MAPI A and MAPlB proteins. 

CC -!- TISSUE SPECIFICITY: Nervous system (spinal cord, brain stem, 

CC cerebellum and cerebrum) . Not expressed in liver, spleen, kidney, 

CC heart or muscle. 

CC -!- DEVELOPMENTAL STAGE: In cerebral cortex, spinal cord and sciatic 
CC nerve levels are high early in development but decrease during 



CC postnatal development and are low in adults. In dorsal root 

CC ganglia levels remain high throughout development. 

CC -!- INDUCTION: By nerve growth factor. 

CC -!- DOMAIN: Has a highly basic region with many copies of the sequence 
CC KKEE and KKEI/V, repeated but not at fixed intervals, which is 

CC responsible for the binding of MAP IB to microtubules. 

CC -!- PTM: LCI is coexpressed with MAP1B. It is a polypeptide generated 
CC from MAP1B by proteolytic processing. It is free to associate with 

CC both MAPI A and MAPlB. It interacts with the amino-terminal region 

CC of MAPlB (By similarity) . 

CC -!- PTM: Phosphorylated. 

CC -!- SIMILARITY: TO MAP 1A. 

CC -!- CAUTION: A C-terminal fragment of this protein (residues 1597 to 
CC 2459) was originally described as neuraxin in Ref.3. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; U52950; AAB17068.1; -. 

DR EMBL; X60370; CAC16162.1; -. 

DR EMBL; XI 662 3; CAA34 620.1; ALT_SEQ. 

DR PIR; A56577; A56577. 

DR InterPro; IPR000102; MAPlB_neuraxin . 

DR Pfam; PF00414; MAPlB_neuraxin; 10. 

DR PROSITE; PS00230; MAP1B_NEURAXIN; 8. 

KW Microtubule; Repeat; Phosphorylation. 
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? 
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MAPI LIGHT CHAIN LCI. 
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MAPlB 1. 
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MAPlB 2. 
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MAPlB 3. 


FT 


REPEAT 
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MAPlB 4. 
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1937 
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MAPlB 5. 


FT 


REPEAT 
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1970 


MAPlB 6. 


FT 


REPEAT 
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MAPlB 7. 


FT 


REPEAT 
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2021 


MAPlB 8. 


FT 


REPEAT 


2022 


2038 


MAPlB 9. 


FT 


REPEAT 


2039 


2055 


MAPlB 10. 


FT 


DOMAIN 


559 


1035 


GLU-RICH. 


FT 


DOMAIN 


588 


786 


LYS-RICH (HIGHLY BASIC, CONTAINS MANY 


FT 








KKEE AND KKEI/V REPEATS) . 


FT 


DOMAIN 


2224 


2312 


LYS-RICH. 


FT 


CONFLICT 


127 


127 


M -> V (IN REF. 1) . 


FT 


CONFLICT 


140 


140 


T -> S (IN REF. 1) . 


FT 


CONFLICT 


2112 


2112 


R -> K (IN REF. 3) . 


FT 


CONFLICT 


2169 


2169 


L -> I (IN REF. 3) . 


SQ 


SEQUENCE 


2459 


AA; 269497 


MW; 2E3F6872DEDB8BA2 CRC64; 



Query Match 4.9%; Score 289; DB 1; Length 2459; 

Best Local Similarity 20.9%; Pred. No. 0.00094; 

Matches 275; Conservative 158; Mismatches 462; Indels 422; Gaps 61; 



Qy 



30 EPEDEEEEEEEEEEDEDEDLEELEVLERKPAAGLSAAPVPTAPAAGAPLMDFGNDFVPPA 89 



If I I I I I I I • • • II • • I II I II - II. 
Db 1004 EAEQSEEEGEEEEDKAEDAREEDHEPDKTEAEDYVMAWDKAAEAGVTEDQY — DFL 1058 

Qy 90 PRGPLPAAPP — VAPERQPSWDPSPVSSTVPAPSPLSAAAVSPSKLPEDDEPPARPPPPP 147 

II I : I I : I : I I : I I II I : I | 

Db 1059 GTPAKQPGVQSPSREPA — SSIHDETLPGGSESEATA SDEENREDQPEEF 1106 

Qy 148 PAS VS PQAE PWT P PAPAPAAP P ST PAAP KRRGS S GAWXXXXKIMDLKEQPGNT I S AGQ 207 

I : : I I I I I : I : Ml 

Db 1107 TATSGYTQSTIEISSEPTPMDEMSTP RDVMTDETNNEETES PSQ 1150 

Qy 208 E DFPSVLLETAASXP SLSPLSAASFKE HEYLGNLSTVLPTEGTLQENV 255 

I : I I II I : I I I : : I : I I : I : : 

Db 1151 EFVNITKYESSLYSQEYSKPWAS FNGLSDGSKTDATDGRDYNASASTISPPSSMEEDKF 1210 

Qy 256 SEAS KEVSEK AKTLLIDRDLTE 277 

I : :: I : I I : : I I I : I : 

Db 1211 SKSALRDAYRPEETDVKTGAELDIKDVSDERLSPAKSPSLSPSPPSPIEKTPLGERSV-N 1269 

Qy 278 FS ELEYSEMGSSFS-VSPKAESAVT VAN PRE EIIVKNK 314 

II I : : I I : : I I I I I : I : I I I : : : : 

Db 1270 FSLTPNEIKASAEGEATAWSPGVTQAWEEHCASPEEKTLEWSPSQSVTGSAGHTPYY 1329 

Qy 315 DEEEKLVSNNILHXQQELPTALTKLVKEDEWSSE — KAKDS FNEKRVAVEAPMRE 368 

II: : : 1:1 I I I : I I I I I : : : I I I 

Db 1330 QSPTDEKSSHLPTEVTENAQAVP VSFEFTEAKDE-NER — SSISPMDE 1374 

Qy 369 EYADFK-PFERVWE VKDSKED SDMLAAGGKIESNLESKVDKKCFADSL 415 

I : I I : I : : I I : I I I : I I I I I : I : I 

Db 1375 PVPDSESPIEKVLSPLRSPPLIGSESAYEDFLSADDKALGRRSESPFEGKNGKQGFSD — 1432 

Qy 416 EQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNP AATESIATNIFPLLED- 471 

I : I I I I I : : I : I I : I I : I : : II 

Db 1433 KESPVS-DLTSDLYQDKQEEKRAGFIPIKEDFSPEKKASDAEIMSSQSALALDE 1485 

Qy 472 PTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTK 523 

II:: : I : : I I I : : I I : I : I : : 
Db 1486 RKLGGDGSPTQVDVSQFGSFKEDTKMSISEGTVSDKSATPVDEGAEDT YSHMEGVAS 1542 

Qy 524 VTEEWA — NMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQES-LYPAA 58 0 

I : II : I I III : : I : I : : : : I I I I I : : I : 

Db 1543 VSTASVATSSFPEPTTDD-VSPSLHAEVGSPHSTEVDDSLSVSWQTPTTFQETEMSPSK 1601 

Qy 581 QLCP SFEESEATPSPVLPDIVMEAPLNSAV P 611 

: I I :: : I : : t : I : I I 

Db 1602 EECPRPMSISPPDFSPKTAKSRT PVQDHRSEQSSMSIEFGQESPEHSLAMDFSRQSPDHP 1661 

Qy 612 SAGASVIQ PSSSPLEASSVNYESIKHE — PENPPPY EEAMSVS-LKVSG 657 

: I I : : I : I : I : I : I l| I : I I I : : I 

Db 1662 TVGAGMLHITENGPTEVDYSPSDIQDSSLSHKIPPTEEPSYTQDNDLSELISVSQVEASP 1721 

Qy 658 IKEEIKEPENINAALQETE APYISIACDLIKE TKLSAEP 696 

I I : III : I |:| : :: ||| : 

Db 1722 STSSAHTPSQIASPLQEDTLSDWPPRDMSLYASLASEKVQSLEGEKLSPKSDISPLTPR 1781 

Qy 697 APDFSDYSEMAK VEQPVPDHSELVEDS 723 

: I I I I : I I : I I :: I : 



Db 1782 ESSPTYSPGFSDSTSGAKESTAAYQTSSSPPIDAAAAEPYGFRSSMLFDTMQHHLALSRD 1841 

Qy 724 SPDSEPVDLFSDD SIPDVP QKQD 746 

I I I I I I : || : | : 

Db 1842 LTTSSVEKDNGGKTPGDFNYAYQKPESTTESPDEEDYDYESHEKTIQAHDVGGYYYEKTE 1901 

Qy 747 ETVML VKESLTETSFESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKD 797 

I : I I : I : I II : I III I : : : 

Db 1902 RTIKSPCDSGYSYETIEKTTKTPEDGGYSCEITEKTTRTPEEGGYSY EISEK 1953 

Qy 798 TLLPDEVS — TLSKKEKI PLQMEELSTAVYSNDD LFISKEA 836 

I llllll: : : : : | : I | | : 

Db 1954 TTRTPEVS GYTYEKTERSRRLLDDISNGYDDTEDGGHTLGDCSYSYETTEKITSFPESES 2013 

Qy 837 QIRETETFSDSSP IEIIDEFPTLISSKTDSFSKLAREYT DL 877 

I I I : I I I :: : : I I : | | | | | 

Db 2014 YSYETTTKTTRSPDTSAYCYETMEKITKTPQASTYSYETSDRCYTPERKSPSEARQDVDL 2073 

Qy 878 EVSH-KSEIANA PDG AGSLPCTELPHDLSLKNIQP KV 913 

I I i : I : : : I : I I I I I : I 

Db 2074 CLVSSCEFKHPKTELSPSFINPNPLEWFAGEEPTEESERPLTQSGGAPPPSGGKQQGRQC 2133 

Qy 914 EEKISFSDDFSKNGSATSKVLLLPPDVSALGHTQAEIESIVKPKVLEKEAEKK-LPSD — 970 

:| I I II II: III I: I I : :|:| 

Db 2134 DETPPTSVSESAPSQTDSDV PPETE ECPSITADANLDSEDESETI PTDKT 2183 

Qy 971 TEKEDRSPS AIFSADLGKTSWDLLYWRDIKKTG 1004 

: I I I I I : I I I II III 

Db 2184 VTYKHMDPPPAPMQDRSPSPRHPDVSMVDPEALAIEQNLGKALKKDLKEKAKTKKPG 2240 

Search completed: September 3, 2004, 16:06:11 
Job time : 24.6624 sees 



